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Description 
INTRODUCTION 
5 Technical Field 

Genetic modification of plant material is provided for seed specific transcription. Production of 
endogenous products may be modulated or new capabilities provided. 

to Background 

The primary emphasis in genetic modification has been directed to prokaryotes and mammalian cells. 
For a variety of reasons plants have proven more intransigent than other eukaryotic cells in the ability to 
genetically manipulate the plants. In part, this has been the result of the different goals involved, since for 
rs the most part plant modification has been directed to modifying the entire plant or a particular plant part in 
a live plant, as distinct from modifying cells in culture. 

For many applications, it will be desirable to provide for transcription in a particular plant part or at a 
particular time in the growth cycle of the plant. Toward this end, there is a substantial interest in identifying 
endogenous plant products whose transcription or expression are regulated in a manner of interest In 
20 identifying such products, one must first look for products which appear at a particular time in the cell 
growth cycle or in a particular plant part, demonstrate its absence at other times or in other parts, identify 
nucleic acid sequences associated with the product and then identify the sequence in the genome of the 
plant in order to obtain the 5'-untranslated sequence associated with transcription. This requires substantial 
investigation in first identifying the particular sequence, followed by establishing that it is the correct 
25 sequence and isolating the desired transcriptional regulatory region. One must then prepare appropriate 
constructs, followed by demonstration that the constructs are efficacious in the desired manner. 

Identifying such sequences is a challenging project, subject to substantial pitfalls and uncertainty. There 
is, however, substantial interest in being able to genetically modify plants, which justifies the substantial 
expenditures and efforts in identifying transcriptional sequences and manipulating them to determine their 
30 utility. 

Relevant Literature 

Crouch et al., In: Molecular Form and Function of the Plant Genome , eds van Vloten-Doting, Groot and 
35 Hall, Plenum Publishing Corp. 1985, pp 555-566; Crouch and Sussex. Planta (1981) 153:64-74; Crouch et 
a(., J. Mol. Appl. Genet. (1983) 2:273-283; and Simon et al., Plant Molecular Biology (1985) 5: 191-201, 
describe various aspects of Brassica napus storage proteins. Beachy et al., EMBO J. (1985) 4:3047-3053; 
Sengupta-Gopalan et al., Proc. Natl. Acad. ScL USA (1985) 82:3320-3324; Greenwood and Chrispeels, Plant 
Physiol. (1985) 79:65-71 and Chen et al., Proc. Natl. Acad. Sci. USA (1986) 83:8560-8564 describe studies 
40 concerned with seed storage proteins and genetic manipulation. Eckes et al„ Mol. Gen. Genet. (1986) 
205:14-22 and Ruhr et al., Science (1986) 232:1106-1112 describe the genetic manipulation of light 
inducible plant genes. 

Sengupta-Gopalan et al (1985), PNAS USA. 82, pp. 3320-3324, describes expression of an entire 
heterologous seed specific gene, phaseolin, in tobacco, under the regulatory control of the transcription 
45 initiation region which naturally provides for expression of that gene. 

EP-A-01 42924 (Agrigenetics Research Limited) describes a plasmid containing phaseolin 5' and 3' 
regulatory regions with a heterologous gene inserted between these regions. The aim and teaching 
presented by this document is to confer insects resistance by expressing the Bt protein in whole plants. 
Scofield et al (1985). J. Cell Biochem. Suppl. 9c, abstract number 1695, discloses the isolation of 
so genomic clones coding for napin. 

SUMMARY OF THE INVENTION 

DNA constructs are provided which are employed in manipulating plant cells to provide for seed- 
55 specific transcription. Particularly, storage protein transcriptional regions are joined to other than the wild- 
type gene and introduced into plant genomes to provide for seed-specific transcription. The constructs 
provide for modulation of endogenous products as'well as production of heterologous products. 
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DESCRIPTION OF THE SPECIFIC EMBODIMENTS 

Novel DNA constructs are provided which allow for modification of transcription in seed, particularly in 
embryos during seed maturation. The DNA constructs comprise a regulated transcriptional initiation region 

s associated with seed formation, preferably in association with embryogenesis and seed maturation. Of 
particular interest are those transcriptional initiation regions associated with storage proteins, such as napin, 
cruciferin, 0-conglycinin, or the like. The transcriptional initiation regions may be obtained from any 
convenient host, particularly plant hosts such as Brassica , e.g. napus or campestris , soybean (Glycine 
max ), bean ( Phaseolus vulgaris ), corn ( Zea mays ), cotton ( Gossypium sp.), safflower (Carthamus tinctorius ), 

70 tomato ( Lycopersican esculentum ), and Cuphea species. 

Downstream from and under the transcriptional initiation regulation of the seed specific region will be a 
sequence of interest which will provide for modification of the phenotype of the seed, by modulating the 
production of an endogenous product, as to amount, relative distribution, or the like, or production of a 
heterologous expression product to provide for a novel function or product in the seed. The DNA construct 

75 will also provide for a termination region, so as to provide an expression cassette into which a gene may be 
introduced. Conveniently, transcriptional initiation and termination regions may be provided separated in the 
direction of transcription by a linker or polylinker having one or a plurality of restriction sites for insertion of 
the gene to be under the transcriptional regulation of the regulatory regions. Usually, the linker will have 
from 1 to 10, more usually from about 1 to 8, preferably from about 2 to 6 restriction sites. Generally, the 

20 linker will be fewer than 100 bp, frequently fewer than 60 bp and generally at least about 5 bp. 

The transcriptional initiation region may be native or homologous to the host or foreign or heterologous 
to the host. By foreign is intended that the transcriptional initiation region is not found in the wild-type host 
into which the transcriptional initiation region is introduced. 

Transcriptional initiation regions of particular interest are those associated with the Brassica napus or 

25 campestris napin genes, acyl carrier proteins, genes that express from about day 7 to day 40 in seed, 
particularly having maximum expression from about day 10 to about day 20, where the expressed gene is 
not found in leaves, while the expressed product is found in seed in high abundance. 

The transcriptional cassette will include in the 5'-3* direction of transcription, a transcriptional and 
translational initiation region, a sequence of interest, and a transcriptional and translational termination 

30 region functional in plants. One or more introns may also be present. The DNA sequence may have any 
open reading frame encoding a peptide of interest, e.g. an enzyme, or a sequence complementary to a 
genomic sequence, where the genomic sequence may be an open reading frame, an intron, a non-coding 
leader sequence, or any other sequence where the complementary sequence will inhibit transcription, 
messenger RNA processing, e.g. splicing, or translation. The DNA sequence of interest may be synthetic, 

35 naturally derived, or combinations thereof. Depending upon the nature of the DNA sequence of interest, it 
may be desirable to synthesize the sequence with plant preferred codons. The plant preferred codons may 
be determined from the codons of highest frequency in the proteins expressed in the largest amount in the 
particular plant species of interest. 

In preparing the transcription cassette, the various DNA fragments may be manipulated, so as to 

40 provide for the DNA sequences in the proper orientation and, as appropriate, in the proper reading frame. 
Toward this end, adapters or linkers may be employed for joining the DNA fragments or other manipulations 
may be involved to provide for convenient restriction sites, removal of superfluous DNA, removal of 
restriction sites, or the like. For this purpose, in vitro mutagenesis, primer repair, restriction, annealing, 
resection, ligation, or the like may be employed, where insertions, deletions or substitutions, e.g. transitions 

45 and transversions, may be involved. 

The termination region which is employed will be primarily one of convenience, since the termination 
regions appear to be relatively interchangeable. The termination region may be native with the transcrip- 
tional initiation region, may be native with the DNA sequence of interest, or may be derived from another 
source. Convenient termination regions are available from the Ti-plasmid of A. tumefaciens , such as the 

so octopine synthase and nopaline synthase termination regions. 

By appropriate manipulations, such as restriction, chewing back or filling in overhangs to provide blunt 
ends, ligation of linkers, or the like, complementary ends of the fragments can be provided for joining and 
ligation. 

In carrying out the various steps, cloning is employed, so as to amplify the amount of DNA and to allow 
55 for analyzing the DNA to ensure that the operations have occurred in a proper manner. A wide variety of 
cloning vectors are available, where the cloning vector includes a replication system functional in E. coli and 
a marker which allows for selection of the transformed cells. Illustrative vectors include pBR332, pUC series, 
M13mp series, pACYC184, etc. Thus, the sequence may be inserted into the vector at an appropriate 
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restriction site(s). the resulting plasmid used to transform the E. coli host the E. coli grown in an 
appropriate nutrient medium and the cells harvested and lysed and the plasmid recovered. Analysis may 
involve sequence analysis, restriction analysis, electrophoresis, or the like. After each manipulation the DNA 
sequence to be used in the final construct may be restricted and joined to the next sequ nee, where each 

5 of the partial constructs may be cloned in the same or different plasmids. 

In addition to the transcription construct, depending upon the manner of introduction of the transcription 
construct into the plant, other DNA sequences may be required. For example, when using the Ti- or Ri- 
plasmid for transformation of plant cells, as described below, at least the right border and frequently both 
the right a left borders of the T-DNA of the Ti-and Ri-plasmids will be joined as flanking regions to the 

w transcription construct. The use of T-DNA for transformation of plant cells has received extensive study and 
is amply described in EPA Serial No. 120,516, Hoekema, In: "The Binary Plant Vector System", Offset- 
drukkerij Kanters B.V., Amsterdm, 1985, Chapter V, Fraley, et aL, Crit. Rev. Plant Sci. , 4:1-46, and An et al„ 
EMBO J. (1985) 4:277-284. 

Alternatively, to enhance integration into the plant genome, terminal repeats of transposons may be 

75 used as borders in conjunction with a transposase. In this situation, expression of the transposase should be 
inducible, or the transposase inactivated, so that once the transcription construct is integrated into the 
genome, it should be relatively stably integrated and avoid hopping. 

The transcription construct will normally be joined to a marker for selection in plant cells. Conveniently, 
the marker may be resistance to a biocide, particularly an antibiotic, such as kanamycin, G418, bleomycin, 

20 hygromycin, chloramphenicol, or the like. The particular marker, employed will be one which will allow for 
selection of transformed cells as compared to cells lacking the DNA which has been introduced. 

A variety of techniques are available for the introduction of CNA into a plant cell host. These techniques 
include transformation with Ti-DNA employing A. tumefaciens or A. rhizogenes as the transforming agent, 
protoplast fusion, injection, electroporation, etc. For transformation with Agrobacterium , plasmids can be 

25 prepared in E. coli which plasmids contain DNA homologous with the Ti-plasmid ( particularly T-DNA. The 
plasmid may or may not be capable of replication in Agrobacterium , that is, it may or may not have a broad 
spectrum prokaryotic replication system, e.g. RK290, depending in part upon whether the transcription 
construct is to be integrated into the Ti-plasmid or be retained on an independent plasmid. By means of a 
helper plasmid, the transcription construct may be transferred to the A. tumefaciens and the resulting 

30 transformed organism used for transforming plant cells. 

Conveniently, explants may be cultivated with the A. tumefaciens or A. rhizogenes to allow for transfer 
of the transcription construct to the plant cells, the plant cells dispersed in an appropriate selective medium 
for selection, grown to callus, shoots grown and plantlets regenerated from the shoots by growing in rooting 
medium. The Agrobacterium host will contain a plasmid having the vir genes necessary for transfer of the 

35 T-DNA to the plant cells and may or may not have T-DNA. For injection and electroporation, disarmed Ti- 
plasmids (lacking the tumor genes, particularly the T-DNA region) may be introduced into the plant cell. 

The constructs may be used in a variety of ways. Particularly, the constructs may be used to modify 
the fatty acid composition in seeds, that is changing the ratio and/or amounts of the various fatty acids, as 
to length, unsaturation, or the like. Thus, the fatty acid composition may be varied, enhancing the fatty acids 

40 of from 10 to 14 carbon atoms as compared to the fatty acids of from 16 to 18 carbon atoms, increasing or 
decreasing fatty acids of from 20 to 24 carbon atoms, providing for an enhanced proportion of fatty acids 
which are saturated or unsaturated, or the like. These results can be achieved by providing for reduction of 
expression of one or more endogenous products, particularly enzymes or cofactors, by producing a 
transcription product which is complementary to the transcription product of a native gene, so as to inhibit 

45 the maturation and/or expression of the transcription product, or providing for expression of a gene, either 
endogenous or exogenous, associated with fatty acid synthesis. Expression products associated with fatty 
acid synthesis include acyl carrier protein, thioesterase, acetyl transacylase, acetyl-coA carboxylasem, 
ketoacyl-synthases, malonyl transacylase, stearoyl-ACP desaturase, and other desaturase enzymes. 

Alternatively, one may wish to provide various products from other sources including mammals, such as 

50 blood factors, lymphokines, colony stimulating factors, interferons, plasminogen activators, enzymes, e.g. 
superoxide dismutase, chymosin, etc., hormones, rat mammary thioesterase 2, phospholipid acyl de- 
saturases involved in the synthesis of cicosapentaenoic acid, human serum albumin. Another purpose is to 
increase the level of seed proteins, particularly mutated seed proteins, having an improved amino acid 
distribution which would be better suited to the nutrient value of the seed. In this situation, one might 

55 provide for inhibition of the native seed protein by producing a complementary DNA sequence to the native 
coding region or non-coding region, where the complementary sequence would not efficiently hybridize to 
the mutated sequence, or inactivate the native transcriptional capability. 
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The cells which have been transformed may be grown into plants in accordance with conventional 
ways. See, for example, McCormick et al., Plant Cell Reports (1986) 5:81-84. These plants may then be 
grown, and either pollinated with the same transformed strain or different strains, identifying the resulting 
hybrid having the desired phenotypic characteristic. Two or more generations may be grown to ensure that 
5 the subject phenotypic characteristic is stably maintained and inherited and then seeds harvested to ensure 
the desired phenotype or other property has been achieved. 

As a host cell, any plant variety may be employed which provides a seed of interest. Thus, for the most 
part, plants will be chosen where the seed is produced in high amounts or a seed specific product of 
interest is involved. Seeds of interest include the oil seeds, such as the Brassica seeds, cotton seeds, 
70 soybean, safflower, sunflower, or the like; grain seeds, e.g. wheat, barley, rice, clover, corn, or the like. 

Identifying useful transcriptional initiation regions may be achieved in a number of ways. Where the 
seed protein has been or is isolated, it may be partially sequenced, so that a probe may be designed for 
identifying messenger RNA specific for seed. To further enhance the concentration of the messenger RNA 
specifically associated with seed, cDNA may be prepared and the cDNA subtracted with messenger RNA or 
75 cDNA from non-seed associated cells. The residual cDNA may then be used for probing the genome for 
complementary sequences, using an appropriate library prepared from plant cells. Sequences which 
hybridize to the cDNA may then be isolated, manipulated, and the 5'-untranslated region associated with 
the coding region isolated and used in expression constructs to identify the transcriptional activity of the 5'- 
untranslated region. 

20 In some instances, the research effort may be further shortened by employing a probe directly for 
screening a genomic library and identifying sequences which hybridize to the probe. The sequences will be 
manipulated as described above to identify s'-untranslated region. 

The expression constructs which are prepared employing the 5'-untranslated regions may be trans- 
formed into plant cells as described previously for determination of their ability to function with a 

25 heterologous structural gene (other than the wild-type open reading frame associated with the 5'- 
untranslated region) and the seed specificity. In this manner, specific sequences may be identified for use 
with sequences for seed specific transcription. Expression cassettes of particular interest include transcrip- 
tional initiation regions from napin genes, particularly Brassica napin genes, more particularly Brassica 
napus or Brassica campestris genes, regulating structural genes associated with lipid production, particu- 

30 larly fatty acid production, including acyl carrier proteins, which may be endogenous or exogenous to the 
particular plant, such as spinach acyl carrier protein, Brassica acyl carrier protein, acyl carrier protein, either 
napus or campestris , Cuphea acyl carrier protein,, acetyl transacylase, malonyl transacylase, 0-ketoacyl 
synthases I and II, thioesterase, particularly thioesterase II, from plant, mammalian, or bacterial sources, for 
example rat thioesterase II, acyl ACP, or phospholipid acyl desaturases. 

35 The following examples are offered by way of illustration and not by way of limitation. 

EXPERIMENTAL 
Materials and Methods 

40 

Cloning Vectors 

Cloning vectors used include the pUC vectors, pUC8 and pUC9 (Vieira and Messing, Gene (1982) 
19:259-268); pUC18 and pUCl9 (Norrander et al., Gene (1983) 26:101-106; Yanisch-Perron et al., Gene - 

45 (1985) 33:103-119), and analogous vectors exchanging chloramphenicol resistance (CAM) as a marker for 
the ampicillin resistance of the pUC plasmids described above (pUC-CAM [pUC12-Cm, pUC13-Cm] 
Buckley, D., Ph.D. Thesis, U.C.S.D., CA 1985). The multiple cloning sites of PUC18 and pUC19 vectors 
were exchanged with those of pUC-CAM to create pCGN565 and pCGN566 which are CAM resistant. Also 
used were pUCH8 and pUCll9, which are respectively, pUC18 and pUC19 with the intergenic region of 

50 M13, from an HgiAl site at 5465 to the Ahalll site at 5941, inserted at the Nde l site of pUC. (Available from 
Vieira J. and Messing, J. Waksman Institute, Rutgers University, Rutgers, N.J.) 

Materials 

55 Terminal deoxynucleotide transferase (TDT); >tt RNaseH, E. coli DNA polymerase, T4 kinase, and 
restriction enzymes were obtained from Bethesda Research Laboratories; E. coli DNA ligase was obtained 
from New England Biolabs; reverse transcriptase was obtained from Life Sciences, Inc.; isotopes were 
obtained from Amersham; X-gal was obtained from Bachem, Inc. Torrance, CA. 
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Example I 

Construction of a Napin Promoter 

5 There are 298 nuclotides upstream of the ATG start codon of the napin gene on the pgN1 clone, a 3.3 
kb EcoRI fragment of B. napus genomic DNA containing a napin gene cloned into pUC8 (available from 
Marti Crouch, University of Indiana). pgN1 DNA was digested with EcoRI and Sstl and ligated to EcoRI/Sstl 
digested pCGN706. (pCGN706 is an Xho l/Pstl fragment containing 3' and polyadenylation sequences of 
another napin cDNA clone pN2 (Crouch et al, 1983 supra ) cloned in pCGN566 at the Sail and Pstl sites.) 

70 The resulting clone pCGN707 was digested with Sail and treated with the enzyme Bal31 to remove some of 
the coding region of the napin gene. The resulting resected DNA was digested with Smal after the Bal31 
treatment and religated. One of the clones, pCGN713, selected by size, was subcloned by Eco RI and 
Bam HI digestion into both EcoRI /Bam HI digested pEMBL18 (Dente et al., Nucleic Acids Res. (1983) 
;H:1645-1655) and pUC118 to give E418 and E4118 respectively. The extent of Bal31 digestion was 

rs conformed by Sanger dideoxy sequencing of E418 template. The Bal31 deletion of the promoter region 
extended only to 57 nucleotides downstream of the start codon, thus containing the 5' end of the napin 
coding sequence and about 300 bp of the 5' non-coding region. E4118 was tailored to delete all of the 
coding region of napin including the ATG start codon by in vitro mutagenesis by the method of Zoller and 
Smith (Nucleic Acids Res. (1982) 10:6487-6500) using an oligonucleotide primer 5'-GATGTTTTGTATGTG- 

20 GGCCCCTAGGAG ATC-3' .Screening for the appropriate mutant was done by two transformations into E. 
coli strain JM83 (Messing J. t In: Recombinant DNA Technical Bulletin, NIH Publication No. 79-99, 2 No. 2, 
1979, pp 43-48) and Sma l digestion of putative transformants. The resulting napin promoter clone is 
pCGN778 and contains 298 nucleotides from the EcoRI site of pgN1 to the A nucleotide just before the 
ATG start codon of napin. The promoter region was subcloned into a chloramphenicol resistant background 

25 by digestion with EcoRI and Bam HI and ligation to EcoRI /Bam HI digested pCGN565 to give pCGN779c. 

Extension of the Napin Promoter Clone 

pCGN779c contains only 298 nucleotides of potential 5'-regulatory sequence. The napin promoter was 
30 extended with a 1.8 kb fragment found upstream of the 5' -EcoRI site on the original XBnNa clone. The -3.5 
kb Xhol fragment of XBnNa (available from M. Crouch), which includes the napin region, was subcloned into 
Sail-digested pUC119 to give pCGN930. A Hindlll site close to a 5' Xhol site was used to subclone the 
HindlH/EcoRI fragment of pCGN930 into Hindlll/EcoRI digested Bluescript + (Vector Cloning Systems, San 
Diego, CA) to give pCGN942. An extended napin promoter was made by ligating pCGN779c digested with 
35 EcoRI and Pstl and pCGN942 digested with EcoRI and Pstl to make pCGN943. This promoter contains -2.1 
kb of sequence upstream of the original ATG of the napin gene contained on XBnNa. A partial sequence of 
the promoter region is shown in Figure 1. 

Napin Cassettes 

40 

The extended napin promoter and a napin 3' -regulatory region is combined to make a napin cassette 
for expressing genes seed-specifically. The napin 3-region used is from the plasmid pCGN1924 containing 
the Xhol/EcoRI fragment from pgN1 (Xho l site is located 18 nucleotides from the stop codon of the napin 
gene) subcloned into EcoRI/Sall digested pCGN565. Hindlll/Pstl digested pCGN943 and pCGN1924 are 
45 ligated to make the napin cassette pCGN744 t with unique cloning sites Sma l, Sai l, and Pstl for inserting 
genes. 

Construction of cDNA Library from Spinach Leaves 

so Total RNA was extracted from young spinach leaves in 4M guanidine thiocyanate buffer as described 
by Facciotti et aj. (Biotechnology (1985) 3:241-246). Total RNA was subjected to oligo(dT)-cellulose column 
chromatography two times to yield poly(A) + RNA as described by Maniatis et aj., (1982) Molecular Cloning: 
A Laboratory Manual , Cold Spring Harbor Laboratory, New York. A cDNA library was constructed in pUC13- 
Cm aaccording to the method of Gubler and Hoffman, (Gene (1983) 25:263-269) with slight modifications. 

55 RNasin was omitted in the synthesis of first strand cDNA as it interfered with second strand synthesis if not 
completely removed, and dCTP was used, to tail the vector DNA and dGTP to tail double-stranded cDNA 
instead of the reverse as described in the paper. The annealed cDNA was transformed to competent E. coli 
JM83 (Messing (1979) supra ) cells according to Hanahan (J. Mol. Biol. (1983) 166:557-580) and spread onto 
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LB agar plates (Miller (1972) Experiments in Molecular Genetics. Cold Spring Harbor Laboratory, Cold 
Spring Harbor, New York) containing -50 ug/ml chloramphenicol and 0.005% X-Gal. 



Identification of Spinach ACP-I cDNA 



A total of approximately 8000 cDNA clones were screened by performing Southern blots (Southern, J. 
Mol. Biol. (1975) 98:503) and dot blot (described below) hybridizations with clone analysis DNA from 40 
pools representing 200 cDNA clones each (see below). A 5' end labeled synthetic oligonucleotide (ACPP4) 
that is at least 66% homologous with a 16 amino acid region of spinach ACP-I (5'-GATGTCTTGAGCCTTG- 
TCCTCATCCACATTGATACCAAACTCCTCCTC-3) is the complement to a DNA sequence that could 
encode the 16 amino acid peptide glu-glu-glu-phe-gly-ile-asn-val-asp-glu-asp-lys-ala-gln-asp-ile, residues 
49-64 of spinach ACP-I (Kuo and Ohlrogge, Arch. Biochem. Biophys. (1984) 234:290-296) and was used for 
an ACP probe. 

Clone analysis DNA for Southern and dot blot hybridizations was prepared as follows. Transformants 
were transferred from agar plates to LB containing 50 ug/ml chloramphenicol in groups of ten clones per 10 
ml media. Cultures were incubated overnight in a 37 *C shaking incubator and then diluted with an equal 
volume of media and allowed to grow for 5 more hours. Pools of 200 cDNA clones each were obtained by 
mixing contents of 20 samples. DNA was extracted from these cells as described by Birnboim and Doly 
(Nucleic Acids Res. (1979) 7:1513-1523). DNA was purified to enable digestion with restriction enzymes by 
extractions with phenol and chloroform followed by ethanol precipitation. DNA was resuspended in sterile, 
distilled water and 1 ug of each of the 40 pooled DNA samples was digested with EcoRI and Hindlll and 
electrophoresed through 0.7% agarose gels. DNA was transferred to nitrocellulose filters following the blot 
hybridization technique of Southern. 

ACPP4 was 5' end-labeled using y- 32 P dATP and T4 kinase according to the manufacturer's specifica- 
tions. Nitrocellulose filters from Southern blot transfer of clone analysis DNA were hybridized (24 hours, 
42 *C) and washed according to Berent et al. (BioTechniques (1985) 3:208-220). Dot blots of the same set 
of DNA pools were prepared by applying 1 ug of each DNA pool to nylon membrane filters in 0.5 M NaOH. 
These blots were hybridized with the probe for 24 hours at 42 *C in 50% formamide/1% SDS/1 M NaCL, 
and washed at room temperature in 2X SSC/0.1% SDS (1X SSC = 0.15M NaCI; 0.015M Na citrate; SDS- 
sodium dodecylsulfate). DNA from the pool which was hybridized by the ACPP4 oligoprobe was trans- 
formed to JM83 cells and plated as above to yield individual transformants. Dot blots of these individual 
cDNA clones were prepared by applying DNA to nitrocellulose filters which were hybridized with the ACPP4 
oligonucleotide probe and analyzed using the same conditions as for the Southern blots of pooled DNA 
samples. 



Nucleotide Sequence Analysis 



The positive clone, pCGNlSOL, was analyzed by digestion with restriction enzymes and the following 
partial map was obtained. 



pUC13-Cm | —35— | 248 | -63 - 1 1 52 | -200 



iii ii ii., 

H H N P Xh E SXBSmSaE 

H -Hln dlll N-Ncol P -PvuI I Xh-Xhol 

E -EcoRI S-Sall X- Xba l Sm- Sma l 

-former PstI s 
with tailing 



B-BamHI Ss-SstI "-former PstI site destroyed 



**polylinker with avaiTable restriction sites indicated 
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The cDNA clone was subcloned into pUC118 and pUCl19 using standard laboratory techniques of 
restriction, ligation, transformation, and analysis (Maniatis et al., (1982) supra ). Single-stranded DNA 
template was prepared and DNA sequence was determined using the Sanger dideoxy technique (Sanger et 
al., (1977) Proc. Nat. Acad. Sci. USA 74:5463-5467). Sequence analysis was performed using a software 
5 package from Intelli-Genetics, Inc. 

pCGNlSOL contains an (approximately) 700 bp cDNA insert including a stretch of A residues at the 3' 
terminus which represents the poly(A) tail of the mRNA. An ATG codon at position 61 is presumed to 
encode the MET translation initiation codon. This codon is the start of a 411 nucleotide open reading frame, 
of which, nucleotides 229-471 encode a protein whose amino acid sequence corresponds almost perfectly 
10 with the published amino acid sequence of ACP-I of Kuo and Ohlrogge supra as described previously. In 
addition to mature protein, the pCGNlSOL also encodes a 56 residue transit peptide sequence, as might be 
expected for a nuclear-encoded chloroplast protein. 

Napin - ACP Construct 

T5 

pCGN796 was constructed by ligating pCGNlSOL digested with Hindlll /Bam HI, pUC8 digested with 
Hindlll and Bam HI and pUC118 digested with Bam HI. The ACP gene from pCGN796 was transferred into a 
chloramphenicol background by digestion with Bam HI and ligation with Bam HI digested pCGN565. The 
resulting pCGN1902 was digested with EcoRI and Sma l and ligated to EcoRI /Sma l digested pUCH8 to give 
20 PCGN1920. The ACP gene in pCGNl920 was digested at the Ncol site, filled in by treatment with the 
Klenow fragment, digested with Sma l and religated to form pCGN1919. This eliminated the 5' -coding 
sequences from the ACP gene and regenerated the ATG. This ACP gene was flanked with Pstl sites by 
digesting pCGN1919 with EcoRI, filling in the site with the Klenow fragment and ligating a Pstl linker. This 
clone is called pCGN945. 

25 The ACP gene of pCGN945 was moved as a Bam HI/Pstl fragment to pUC118 digested with Bam HI and 
Pst l to create pCGN945a so that a Sma l site (provided by the pUCH8) would be at the 5' -end of the ACP 
sequences to facilitate cloning into the napin cassette pCGN944. pCGN945a digested with Sma l and Pstl 
was ligated to pCGN944 digested with Sma l and Pstl to produce the napin ACP cassette pCGN946. The 
napin ACP cassette was then transferred into the binary vector pCGN783 by cloning from the Hindlll site to 

30 produce pCGN948. 

Construction of the Binary Vector pCGN783 

pCGN783 is a binary plasmid containing the left and right T-DNA borders of A. tumefaciens (Barker et 
35 al., Plant Mol. Biol. (1983) 2:335-350); the gentamicin resistance gene of pPHUI (Hirsch et al., Plasmid - 
(1984), 12:139-141) the 35S promoter of cauliflower mosaic virus (CaMV) (Gardner et al., Nucleic Acids 
Res. (1981) 9:2871-2890), the kanamycin resistance gene of Tn5 (Jorgenson et a}., infra and Wolff et al., 
ibid (1985) 13:355-367) and the 3' region from transcript 7 of pTiA6 (Barker et al., supra (1983)). 

To obtain the gentamicin resistance marker, the gentamicin resistance gene was isolated from a 3.1 kb 
40 EcoRI-Pstl fragment of pPHIJI and cloned into pUC9 yielding pCGN549. The Hindlll-BamHI fragment 
containing the gentamicin resistance gene was substituted for the Hindlll-Bglll fragment of pCGN587 
creating pCGN594. 

pCGN587 was prepared as follows: The Hindlll- Sma l fragment of Tn5 containing the entire structural 
gene for APH II (Jorgenson et al., Mol. Gen. Genet. (1979) 177:65) was cloned into pUC8 (Vieira and 

45 Messing, Gene (1982) 19:259), converting the fragment into a Hindlll-EcoRl fragment, since there is an 
EcoRI site immediately adjacent to the Sma l site. The Pstl-EcoRI fragment containing the 3'-portion of the 
APH II gene was then combined with an EcoRI- Bam Hl-Sall-Pstl linker into the EcoRI site of pUC7 
(pCGN546W). Since this construct does not confer kanamycin resistance, kanamycin resistance was 
obtained by inserting the Bglll-Pstl fragment of the APH II gene into the Bam HI-Pstl site (pCGN546X). This 

so procedure reassembles the APH II gene, so that Eco sites flank the gene. An ATG codon was upstream from 
and out of reading frame with the ATG initiation codon of APH II. The undesired ATG was avoided by 
inserting a Sau3A-Pstl fragment from the 5' -end of APH II, which fragment lacks the superfluous ATG, into 
the Bam HI-Pstl site of pCGN546W to provide plasmid pCGN550. 

The EcoRI fragment containing the APH II gene was then cloned into the unique EcoRI site of pCGN451, 

55 which contains an octopine synthase cassette for expression, to provide pCGN552 (1ATG). 

pCGN451 includes an octopine cassette which contains about 1556 bp of the- 5' non-coding region 
fused via an 'EcoRI linker to the 3' non-coding region of the octopine synthase gene of pTiA6. The pTi 
coordinates are 11,207 to 12,823 for the 3' region and 13,643 to 15,208 for the 5' region as defined by 
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Barker et al., Plant Mol. Biol. (1983) 2:325. 

The S 7 " fragment was obtained as follows. A small subcloned fragment containing the 5' end of the 
coding region, as a Bam HI-EcoRI fragment was cloned in pBR322 as plasmid pCGN407. The BamHI-EcoRI 
fragment has an Xmn l site in the coding region, while pBR322 has two Xmnl sites. pCGN407 was digested 

s with Xmn l, resected with Bal31 nuclease and EcoRI linkers added to the fragments. Aft r EcoRI and Bam HI 
digestion, the fragments were size fractionated, the fractions cloned and sequenced. In one case, the entire 
coding region and 10 bp of the 5' non-translated sequences had been removed leaving the 5' non- 
translated region, the mRNA cap site and 16 bp of the 5' non-translated region (to a Bam HI site) intact. This 
small fragment was obtained by size fractionation on a 7% acrylamide gel and fragments approximately 130 

to bp long eluted. 

This size fractionated DNA was ligated into M13mp9 and several clones sequenced and the sequence 
compared to the known sequence of the octopine synthase gene. The M13 construct was designated p14, 
which plasmid was digested with Bam HI and EcoRI to provide the small fragment which was ligated to a 
Xhol to Bam HI fragment containing upstream 5' sequences from pTiA6 (Garfinkel and Nester, J. Bacteriol . 

;s (1980) 144:732) and to an EcoRI to Xho l fragment containing the 3' sequences. 

The resulting Xhol fragment was cloned into the Xhol site of a pUC8 derivative, designated pCGN426. 
This plasmid differs from pUC8 by having the sole EcoRI site filled in with DNA polymerase I, and having 
lost the Pstl and Hindlll site by nuclease contamination of Hindi restriction endonuclease, when a Xho l 
linker was inserted into the unique Hindi site of pUC8. The resulting plasmid pCGN451 has a single EcoRI 

20 site for the insertion of protein coding sequences between the 5' non-coding region (which contains 1 ,550 
bp of 5' non-transcribed sequence including the right border of the T-DNA, the mRNA cap site and 16 bp of 
5' non-translated sequence) and the 3' region (which contains 267 bp of the coding region, the stop codon, 
196 bp of 3' non-translated DNA, the poIyA site and 1,153 bp of 3' non-transcribed sequence). pCGN451 
also provides the right T-DNA border. 

25 The resulting plasmid pCGN451 having the ocs 5' and the ocs 3' in the proper orientation was digested 
with EcoRI and the Eco RI fragment from pCGN551 containing the intact kanamycin resistance gene 
inserted into the Eco RI site to provide pCGN552 having the kanamycin resistance gene in the proper 
orientation. 

This ocs/KAN gene was used to provide a selectable marker for the trans type binary vector pCGN587. 

30 The 5' portion of the engineered octopine synthase promoter cassette consists of pTiA6 DNA from the 
Xho l at bp 15208-13644 (Barker's numbering), which also contains the T-DNA boundary sequence (border) 
implicated in T-DNA transfer. In the plasmid pCGN587, the ocs/KAN gene from pCGN552 provides a 
selectable marker as well as the right border. The left boundary region was first cloned in M13mp9 as a 
Hindlll -Sma l piece (pCGN502) (base pairs 602-2213) and recloned as a Kpnl-EcoRI fragment in pCGN565 

35 to provide pCGN580. pCGN565 is a cloning vector based on pUC8-Cm, but containing pUC18 linkers. 
pCGN580 was linearized with Bam HI and used to replace the smaller Bglll fragment of pVCK102 (Knauf 
and Nester, Plasmid (1982) 8:45), creating pCGN585. By replacing the smaller Sail fragment of pCGN585 
with the Xho l fragment from pCGN552 containing the ocs/KAN gene, pCGN587 was obtained. 

The pCGN594 Hindlll- Bam HI region, which contains an 5'-ocs-kanamycin-ocs-3' (ocs is octopine 

40 synthase with 5' designating the promoter region and 3' the terminator region, see U.S. application serial 
no. 775,923, filed September 13, 1985) fragment was replaced with the Hindlll- Bam HI poly linker region from 
PUC18. 

pCGN566 contains the EcoRI-Hindlll linker of pUC18 inserted into the EcoRI-Hindlll sites of pUC13-Cm. 

The Hindlll-Bglll fragment of pNW3lC-8,29-1 (Thomashow et al., Cell (1980) 19:729) containing ORF1 and 
45 -2 of pTiA6 was subcloned into the Hindlll -Bam HI sites of pCGN566 producing pCGN703. 

The Sau3A fragment of pCGN703 containing the 3' region of transcript 7 (corresponding to bases 2396- 

2920 of pTiA6 (Barker et al.. (1983) supra ) was sub-cloned into the Bam HI site of pUCl8 producing 

pCGN709. The EcoRI -Sma l polylinker region of pCGN709 was substituted with the EcoRI -Sma l fragment of 

pCGN587, which contains the kanamycin resistance gene (APH 3-II) producing pCGN726. 
so The EcoRI-Sall fragment of pCGN726 plus the Bglll-EcoRI fragment of pCGN734 were inserted into the 

Bam HI-Sall site of pUC8-Cm producing pCGN738. pCGN726c is derived from pCGN738 by deleting the 

900 bp EcoRI-EcoRI fragment. 

To construct pCGN167, the Alul fragment of CaMV (bp 7144-7735) (Gardner et al., Nucl. Acid Res. - 

(1981) 9:2871-2888) was obtained by digestion with Alul and cloned into the Hindi site of M13mp7 
55 (Messing et al .. Nucl. Acids Res. (1981) 9:309-321) to create C614. An Eco RI digest of C61 4 produced the 

EcoRI fragment from C614 containing the 35S promoter which was cloned into the EcoRI site-of pUC8 

(Vieira and Messing, Gene (1982) 19:259) to produce pCGN146. 
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To trim the promoter region, the Bglll site (bp 7670) was treated with Bglll and resected with Bal31 and 
subsequently a Bglll linker was attached to the Bal31 treated DNA to produce pCGN147. 

pCGN148a containing a promoter region, selectable marker (KAN with 2 ATG's) and 3' region, was 
prepared by digesting pCGN528 with Bglll and inserting the Bam HI-Bglll promoter fragment from 
s pCGN147. This fragment was cloned into the Bglll site of pCGN528 so that the Bglll site was proximal to 
the kanamycin gene of pCGN528. 

The shuttle vector used for this construct, pCGN528, was made as follows. pCGN525 was made by 
digesting a plasmid containing Tn5 which harbors a kanamycin gene (Jorgenson et aL, Mol. Gen. Genet. 
1979) 177:65) with Hindlll- Bam HI and inserting the Hindlll-BamHI fragment containing the kanamycin gene 
70 into the Hindlll- Bam HI sites in the tetracycline gene of pACYC184 (Chang and Cohen, J. Bacteriol. (1978) 
134:1141-1156). pCGN526 was made by inserting the Bam HI fragment 19 of pTiA6 (Thomashow et al„ Cell 
(1980) 19:729-739), modified with Xhol linkers inserted into the Smal site, into the Bam HI site of pCGN525. 
pCGN528 was obtained by deleting the small Xhol fragment from pCGN526 by digesting with Xhol and 
religating. 

f5 pCGN149a was made by cloning the Bam HI-kanamycin gene fragment from pMB9KanXXI into the 
Bam HI site of pCGN148a. 

pMB9KanXXI is a pUC4K variant (Vieira and Messing, Gene (1982) 19:259-268) which has the Xhol site 
missing but contains a functional kanamycin gene from Tn903 to allow for efficient selection in Agrobac- 
terium . 

20 pCGNl49a was digested with Bglll and Sph l. This small Bglll-Sphl fragment of pCGNl49a was 
replaced with the Bam HI- Sph l fragment from Ml (see below) isolated by digestion with Bam HI and Sph l. 
This produces pCGN167, a construct containing a full length CaMV promoter, 1ATG-kanamycin gene, 3' 
end and the bacterial Tn903-type kanamycin gene. Ml is an EcoRI fragment from pCGN546X (see 
construction of pCGN587) and was cloned into the EcoRI cloning site of M13mp9 in such a way that the 

25 Pstl site in the 1ATG-kanamycin gene was proximal to the polylinker region of M13mp9. 

The Hindlll- Bam HI fragment in the pCGN167 containing the CaMV-35S promoter, 1ATG-kanamycin 
gene and the Bam HI-fragment 19 of pTiA6 was cloned into the Bam HI-Hindlll sites of pUCl9 creating 
pCGN976. The 35S promoter and 3' region from transcript 7 was developed by inserting a 0.7 kb Hindlll- 
EcoRI fragment of pCGN976 (35S promoter) and the 0.5 kb EcoRI-Saul fragment of pCGN709 (transcript 

30 7*3*) into the Hindlll-Sall sites of pCGN566 creating pCGN766c. 

The 0.7 kb Hindlll-EcoRI fragment of pCGN766c (CaMV-35S promoter) was ligated to the 1.5 kb EcoRI- 
Sall fragment in pCGN726c (1ATG-KAN 3' region) followed by insertion into the Hindlll-Sall sites of pUCl19 
to produce pCGN778. The 2.2 kb region of pCGN778, Hindlll-Sall fragment containing the CaMV-35S 
promoter and 1ATG-KAN-3' region was used to replace the Hindlll-Sall linker region of pCGN739 to 

35 produce pCGN783. 

Transfer of the Binary Vector pCGN948 into Agrobacterium 

pCGN948 was introduced into Agrobacterium tumefaciens EHA101 (Hood et al., J. Bacteriol . (1986) 
40 168:1291-1301) by transformation. An overnight 2 ml culture of EHA101 was grown in MG/L broth at 30 # C. 
0.5 ml was inoculated into 100 ml of MG/L broth (Garfinkel and Nester, J. Bacteriol . (1980) 144:732-743) 
and grown in a shaking incubator for 5 h at 30 *C. The cells were pelleted by centrifugation at 7K, 
resuspended in 1 ml of MG/L broth and placed on ice. Approximately, 1 ug of pCGN948 DNA was placed 
in 100 ul of MG/L broth to which 200 ul of the EHA101 suspension was added; the tube containing the 
45 DNA-cell mix was immediately placed into a dry ice/ethanol bath for 5 minutes. The tube was quick thawed 
by 5 minutes in 37 *C water bath followed by 2 h of shaking at 30 - C after adding 1 ml of fresh MG/L 
medium. The cells were pelleted and spread onto MG/L plates (1.5% agar) containing 100 mg/l gentamicin. 
Plasmid DNA was isolated from individual gentamicin-resistant colonies, transformed back into E. coli, and 
characterized by restriction enzyme analysis to verify that the gentamicin-resistant EHA101 contained intact 
so copies of pCGN948. Single colonies are picked and purified by two more streakings on MG/L plates 
containing 100 mg/l gentamicin. 

Transformation and Regeneration of B. Napus 

55 Seeds of Brassica napus cv Westar were soaked in 95% ethanol for 4 minutes. They were sterilized in 
1% solution of sodium hypochlorite with 50 ul of "Tween 20" surfactant per 100 ml sterilent solution. After 
"soaking for 45 minutes, seeds were rinsed 4 times with sterile distilled water. They were planted in sterile 
plastic boxes 7 cm wide, 7 cm long, and 10 cm high (Magenta) containing 50 ml of 1/1 0th concentration of 
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MS (Murashige minimal organics medium, Gibco) with added pyridoxine (500 ug/1), nicotinic acid (50 ug/l), 
glycine (200 ug/1) and solidified with 0.6% agar. The seeds germinated and were grown at 22* C in a 16h- 
8h light-dark cycle with light intensity approximately 65 uEm~ 2 s~ 1 . After 5 days the seedlings were taken 
under sterile conditions and the hypocotyls excised and cut into pieces of about 4 mm in length. The 

s hypocotyl segments were placed on a feeder plate or without the feeder layer on top of a filter paper on the 
soldified B5 0/1/1 or B5 0/1/0 medium. B5 0/1/0 medium contains B5 salts and vitamins (Gamborg, Miller 
and Ojima, Experimental Cell Res. (1968) 50:151-158), 3% sucrose, 2,4-dichlorophenoxyacetic acid (1.0 
mg/l), pH adjusted to 5.8, and the medium is solidifed with 0.6% Phytagar; B5 0/1/1 is the same with the 
addition of 1.0 mg/l kinetin. Feeder plates were prepared 24 hours in advance by pipetting 1.0 ml of a 

70 stationary phase tobacco suspension culture (maintained as described in Fillatti et al., Molecular General 
Genetics (1987) 206:192-199) onto B5 0/1/0 or B5 0/1/1 medium. Hypocotyl segments were cut and placed 
on feeder plates 24 hours prior to Agrobacterium treatment. 

Agrobacterium tumefaciens (strain EHA101 x 948) was prepared by incubating a single colony of 
Agrobacterium in MG/L broth at 30 9 C. Bacteria were harvested 16 hours later and dilutions of 10 8 bacteria 

75 per ml were prepared in MG/L broth. Hypocotyl segments were inoculated with bacteria by placing the 
segments in an Agrobacterium suspension and allowing them to sit for 30-60 minutes, then removing and 
transferring to Petri plates containing B5 0/1/1 or 0/1/0 medium (0/1/1 intends 1 mg/1 2,4-D and 1 mg/l 
kinetin and 0/1/0 intends no kinetin). The plates were incubated in low light at 22 # C. The co-incubation of 
bacteria with the hypocotyl segments took place for 24-48 hours. The hypocotyl segments were removed 

20 and placed on B5 0/1/1 or 0/1/0 containing 500 mg/l carbenicillin (kanamycin sulfate at 10. 25, or 50 mg/l 
was sometimes added at this time) for 7 days in continuous light (approximately 65 uEm~ 2 S~ 1 ) at 22 *C. 
The segments were transferred to B5 salts medium containing 1% sucrose, 3 mg/l benzylamino purine 
(BAP) and 1 mg/l zeatin. This was supplemented with 500 mg/l carbenicillin, 10, 25, or 50 mg/l kanamycin 
sulfate, and solidified with 0.6% Phytagar (Gibco). Thereafter, explants were transferred to fresh medium 

25 every two weeks. 

After one month green shoots developed from green calli which were selected on media containing 
kanamyicin. Shoots continued to develop for three months. The shoots were cut from the calli when they 
were at least 1 cm high and placed on B5 medium with 1 % sucrose, no added growth substances, 300 mg/I 
carbencillin , and solidified with 0.6% phytagar. The shoots continued to grow and several leaves were 

30 removed to test for neomycin phosphotransferase II (NPTII) activity. Shoots which were positive for NPTll 
activity were placed in Magenta boxes containing B5 0/1/1 medium with 1% sucrose, 2 mg/l indolebutyric 
acid, 200 mg/l carbenicillin, and solidified with 0.6% Phytagar. After a few weeks the shoots developed 
roots and were transferred to soil. The plants were grown in a growth chamber at 22* C in a 16-8 hours 
light-dark cycle with light intensity 220 uEm~ 2 s -1 and after several weeks were transferred to the 

35 greenhouse. 

Southern Data 

Regenerated B. napus plants from cocultivations of Agrobacterium tumefaciens EHA101 containing 

40 pCGN948 and B. napus hypocotyls were examined for proper integration and embryo-specific expression of 
the spinach leaf ACP gene. Southern analysis was performed using DNA isolated from leaves of 
regenerated plants by the method of Dellaporta et al., (Plant Mol. Biol. Rep. (1983) 1^:19-21) and purified 
once by banding in CsCI. DNA (10 ug) was digested with the restriction enzyme EcoRI, electrophoresed on 
a 0.7% agarose gel and blotted to nitrocellulose (see Maniatis et al., (1982) supra .). Blots were probed with 

45 pCGN945 DNA containing 1.8 kb of the spinach ACP sequence or with the EcoRI/ Hindlll fragment isolated 
from pCGN936c (made by transferring the Hindlll/EcoRI fragment of pCGN930 into pCGN566) containing 
the napin 5' sequences labeled with 32 P-dCTP by nick translation (described by the manufacturer, BRL Nick 
Translation Reagent Kit, Bethesda Research Laboratories, Gaithersburg, MD). Blots were prehybridized and 
hybridized in 50% formamide, 10x Denhardt's 5xSSC, 0.1% SDS, 5 mM EDTA, 100 ug/ml calf thymus DNA 

so and 10% dextran sulfate (hybridization only) at 42 *C. (Reagents described in Maniatis et al.. (1982) supra .) 
Washes were in 1xSSC, 0.1% SDS, 30 min and twice in O.lxSSC, 0.1% SDS at 55 *C. 

Autoradiograms showed two bands of approximately 3.3 and 3.2 kb hybridized in the EcoRI digests of 
DNA from four plants when probed with the ACP gene (pCGN945) indicating proper integration of the 
spinach leaf ACP construct in the plant genome since 3.3 and 3.2 kb EcoRI fragments are present in the T- 

55 DNA region of pCGN948. The gene construct was present in single or multiple loci in the different plants as 
judged by the number of plant DNA-construct DNA border fragments detected when probed with the napin 
5' sequences. 
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Northern Data 

Expression of the integrated spinach leaf ACP gene from the napin promoter was detected by Northern 
analysis in seeds but not leaves of one of the transformed plants shown to contain the construct DNA. 
Developing seeds w re collected from the transformed plant 21 days post-anthesis. Embryos were 
dissected from the seeds and frozen in liquid nitrogen. Total RNA was isolated from the seed embryos and 
from leaves of the transformed plant by the method of Crouch et al ., 1983, supra , electrophoresed on 
formaldehyde-containing 1.5% agarose gels as described (Shewmaker et al ., Virology (1985) 140:281-288) 
and blotted to nitrocellulose (Thomas, Proc. Natl. Acad. Sci. USA (1980) 77:5201-5205). Blots were 
prehybridized, hybridized, and washed as described above. The probe was an isolated Pst l /Bam HI fragment 
from pCGN945 containing only spinach leaf ACP sequences labeled by nick translation. 

An RNA band of -0.8 kb was detected in embryos but not leaves of the transformed plant indicating 
seed-specific expression of the spinach leaf ACP gene. 

rs Example II 

Construction of B. Campestris Napin Promoter Cassette 

A Bglll partial genomic library of B. campestris DNA was made in the lambda vector Charon 35 using 
20 established protocols (Maniatis et al., 1982, supra ). The titer, of the amplified library was -1.2 x 10 9 
phage/ml. Four hundred thousand recombinant bacteriophage were plated at a density of 10 s per 9 x 9 in. 
NZY plate (NZYM as described in Maniatis et al., 1982, supra ) in N2Y + 10 mM MgS04 + 0.9% agarose 
after adsorption to DH1 E. coli cells (Hanahan, Mol. Biol . (1983) 166:557) for 20 min at 37 °C. Plates were 
incubated at 37* C for -13 hours, cooled at 4*C for 2.5 hours and the phage were lifted onto Gene Screen 
25 Plus (New England Nuclear) by laying precut filters over the plates for approximately 1 min and peeling 
them off. The adsorbed phage DNA was immobilized by floating the filter on 1.5 M NaCI, 0.5 M NaOH for 1 
min., neutralizing in 1.5 M NaCI, 0.5 M Tris-HCI, pH 8.0 for 2 min and 2XSSC for 3 min. Filters were air 
dried until just damp, prehybridized and hybridized at 42 • C as described for Southern analysis. Filters were 
probed for napin-containing clones using an Xho l/Sall fragment of the cDNA clone BE5 which was isolated 
30 from the B. campestris seed cDNA library described using the probe pN1 (Crouch et al., 1983, supra ). 
Three plaques were hybridized strongly on duplicate filters and were plaque purified as described (Maniatis 
et al., 1982, supra ). 

One of the clones named lambda CGN1-2 was restriction mapped and the napin gene was localized to 
overlapping 2.7 kb Xho l and 2.1 kb Sail restriction fragments. The two fragments were subcloned from 

35 lambda CGN1-2 DNA into pCGN789 (a pUC based vector the same as pUC119 with the normal polylinker 
replaced by the synthetic linker - 5' GGAATTCGTCGACAGATCTCTGCAGCTCGAGGGATCCAAGCTT 3' - 
(which represents the polylinker EcoRI, Sail, Bglll, Pstl, Xho l, Bam HI, Hindlll). The identity of the subclones 
as napin was confirmed by sequencing. The entire coding region sequence as well as extensive 5' 
upstream and 3' downstream sequences were determined (Figure 2). The lambda CGN1-2 napin gene is 

40 that encoding the mRNA corresponding to the BE5 cDNA as determined by the exact match of their 
nucleotide sequences. 

An expression cassette was constructed from the 5' -end and the 3' -end of the lambda CGN1-2 napin 
gene as follows in an analogous manner to the construction of pCGN944. The majority of the napin coding 
region of pCGN940 was deleted by digestion with Sail and religation to form pCGN1800. Single-stranded 

45 DNA from pCGN1800 was used in an in vitro mutagenesis reaction (Adelman et al., DNA (1983) 2:183-193) 
using the synthetic oligonucleotide ~~ ^ GCTTGTTCGCCATGGATATCTTCTGTATGTTC 3'. This 
oligonucleotide inserted an Eco RV and an Nco 1 restriction site at the junction of the promoter region and 
the ATG start codon of the napin gene. An appropriate mutant was identified by hybridization to the 
oligonucleotide used for the mutagenesis and sequence analysis and named pCGN1801. 

50 A 1.7 kb promoter fragment was subcloned from pCGN1801 by partial digestion with EcoRV and 
ligation to pCGN786 (a pCGN566 chloramphenicol based vector with the synthetic linker described above in 
place of the normal polylinker) cut with Eco RI and blunted by fill## resulting expression cassette, 
pCGN1803 contains 1.725 kb of napin promoter sequence, and 1.265 kb of napin 3' sequences with the 
unique cloning sites Sail, BgN, Pstl, and Xho l in between. Any sequence that requires seed-specific 

55 transcription or expression in Brassica, i.e., a fatty acid gene could be inserted in this cassette in a manner 
analogous to that described for spinach leaf ACP and the B. napus napin cassette in Example I. 
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Example III 

Other seed-specific promoters may be isolated from genes encoding proteins involved in seed 
triacylglycerol synthesis, such as acyl carrier protein from Brassica seeds. Immature seed were collected 

5 from Brassica campestris cv. "R-500," a self-compatible variety of turnip rape. Whole seeds were collected 
at stages corresponding approximately to 14 to 28 days after flowering. RNA isolation and preparation of a 
cDNA bank was as described above for the isolation of a spinach ACP cDNA clone except that the vector 
used was pCGN565. To probe the cDNA bank, the oligonucleotide (5')-ACTTTCTCAACTGTCTCTGGTT- 
TAGCAGC-(3') was synthesized using an Applied Biosystems DNA Synthesizer, model 380A, according to 

70 manufacturer's recommendations. This synthetic DNA molecule will hybridize at low stringencies to DNA or 
RNA sequences coding for the amino acid sequence (ala-ala-lys-pro-glu-thr-val-glu-Iys-val). This amino acid 
sequence has been reported for ACP isolated from seeds of Brassica napus (Slabas et al., 7th international 
Symposium of the Structure and Function of Plant Lipids, University of California, Davis, CA, 1986); ACP 
from B. campestris seed is highly homologous. Approximately 2200 different cDNA clones were analyzed 

75 using a colony hybridization technique (Taub and Thompson, Anal. Biochem . (1982) 126:222-230) and 
hybridization conditions corresponding to Wood et al., (Proc. Natl. Acad. Sci . (1985) 82:1585-1588). DNA 
sequence analysis of two cDNA clones showing obvious hybridization to the oligonucleotide probe indicated 
that one, designated pCGNIBcs, indeed coded for an ACP-precursor protein by the considerable homology 
of the encoded amino acid sequence with ACP proteins described from Brassica napus (Slabas et aj., 1980 

20 supra ). Similarly to Example II, the ACP cDNA clone can be used to isolate a genomic clone from which an 
expression cassette can be fashioned in a manner directly analogous to the B. campestris napin cassette. 

Other Examples 

25 Ninety-six clones from the 14-28 day post-anthesis B. campestris seed cDNA library (described in the 
previous example) were screened by dot blot hybridization of miniprep DNA on Gene Screen Plus nylon 
filters. Probes used were radioactively labeled first-strand synthesis cDNAs made from the day 14-28 post- 
anthesis mRNA or from B. campestris leaf mRNA. Clones which hybridized strongly to seed cDNA and little 
or not at all to leaf cDNA were catalogued. A number of clones were identified as representing the seed 

30 storage protein napin by cross-hybridization with an Xhol/Sall fragment of pNI (Crouch et al., 1983, supra ), a 
B. campestris genomic clone as a source of an embryo-specific promoter. 

Other seed-specific genes may also serve as useful sources of promoters. cDNa clones of cruciferin, 
the other major seed storage protein of B. napus , have been identified (Simon et al., 1985, supra ) and could 
be used to screen a genomic library for promoters. 

35 Without knowing their specific functions, yet other cDNA clones can be classified as to their level of 
expression in seed tissues, their timing of expression (i.e., when post-anthesis they are expressed) and their 
approximate representation (copy number) in the B. campestris genome. Clones fitting the criteria neces- 
sary for expressing genes related to fatty acid synthesis or other seed functions can be used to screen a 
genomic library for genomic clones which contain the 5' and 3' regulatory regions necessary for 

40 expression. The non-coding regulatory regions can be manipulated to make a tissue-specific expression 
cassette in the general manner described for the napin genes in previous examples. 

One example of a cDNA clone is EA9. It is highly expressed in seeds and not leaves from B. 
campestris . It represents a highly abundant mRNA as shown by cross-hybridization of seven other cDNAs 
from the library by dot blot hybridization. Northern blot analysis of mRNA isolated from day 14 seed, and 

45 day 21 and 28 post-anthesis embryos using a 700 bp EcoRI fragment of EA9 as a probe shows that EA9 is 
highly expressed at day 14 and expressed at a much lower level at day 21 and day 28. The restriction map 
of EA9 was determined and the clone sequenced. Identification of a polyadenylation signal and of polyA 
tails at the 3-end of EA9 confirms the orientation of the cDNA clone and the direction of transcription of the 
mRNA. The partial sequence provided here for clone EA9 (Figure 3) can be used to synthesize a probe 

so which will identify a unique class of Brassica seed-specific promoters. 

It is evident from the above results, that transcription or expression can be obtained specifically in 
seeds, so as to permit the modulation of phenotype or change in properties of a product of seed, 
particularly of the embryo. It is found that one can use transcriptional initiation regions associated with the 
transcription of sequences in seeds in conjunction with sequences other than the normal sequence to 

55 produce endogenous or exogenous proteins or modulate the transcription or expression of nucleic acid 
sequences. In this manner, seeds can be used to produce novel products, to provide for improved protein 
compositions, to modify the distribution of fatty acid, and the like. 
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All pubficatons and patent applications mentioned in this specification are indicative of the level of skill 
of those skilled in the art to which this invention pertains. All publications and patent applications are herein 
incorporated by reference to the same extent as if each individual publication or patent application was 
specifically and individually indicated to be incorporated by reference. 

5 

Claims 

Claims for the following Contracting States : BE, CH, DE, FR, GB, GR, IT, LI, LU, NL, SE 

1. A DNA construct comprising in the 5'-3' direction of transcription: 

w a seed specific transcriptional initiation region which is from other than the bean phaseolin 

promoter; 

a DNA sequence of interest other than the natural coding sequence joined to said initiation region; 

and 

a transcriptional termination region. 

15 

2. A DNA construct according to claim 1 , wherein said construct further comprises a translations initiation 
region immediately downstream of said transcription initiation region. 

3. A DNA construct according to any one of claims 1 or 2 wherein said DNA sequence of interest is an 
20 open reading frame encoding an amino acid sequence. 

4. A DNA construct according to any one of claims 1 or 2 wherein said DNA sequence of interest is 
complementary to a mRNA endogenous to a plant seed cell. 

25 5. A DNA construct according to any one of claims 1 to 4 wherein said natural coding sequence is 
expressed in a seed embryo cell. 

6. A DNA construct according to any one of claims 1 to 5 wherein said natural coding sequence is 
expressed at about 7 to about 40 days postanthesis. 

30 

7. A DNA construct according to claim 6 wherein said natural coding sequence is expressed at about 10 
to about 20 days postanthesis. 

8. A DNA construct according to any one of claims 1 to 7 wherein said initiation region is a storage 
35 protein gene initiation region or a fatty acid biosynthesis protein gene initiation region. 

9. A DNA construct according to claim 8 wherein said storage protein is napin. 

10. A DNA construct according to claim 8 wherein said fatty acid biosynthesis protein is acyl carrier 
40 protein. 

11. A DNA construct according to any one of claims 1 to 10 wherein said transcriptional initiation region is 
from a Brassica seed gene. 

45 12. A DNA construct according to any one of claims 1 to 4 wherein said transcriptional initiation region is 
obtainable from Fig. 1 or Fig. 2, and comprises a region of about 300 nucleotides 5* of the ATG 
translation start codon of a napin gene. 

13. A DNA construct according to any one of claims 1 to 12 wherein said DNA sequence of interest is 
so associated with fatty acid synthesis. 

14. A DNA construct according to any one of claims 1 to 13 wherein said transcriptional termination region 
is from a gene native to said transcriptional is from a gene native to said transcriptional initiation region. 

55 15. A DNA construct comprising in the 5'-3' direction of transcription, a seed-specific transcriptional 
initiation region wherein said transcriptional initiation region is free from the native DNA sequence under 
the regulatory control of said initiation region, a cloning site, and a transcriptional termination region. 
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16. A DNA construct according to claim 15 also comprising a translational initiation region positioned after 
said transcriptional initiation region in the 5'-3' direction. 

17. A DNA construct of claim 16 wherein a ONA sequence of interest other than the native sequence is 
5 inserted into said cloning site. 

18. A vector comprising a DNA construct according to any one of claims 1 to 17, a prokaryotic replication 
system and a marker for selection of transformed prokaryotes comprising said marker. 

io 19. A plant cell or seed comprising a DNA construct according to any one of claims 1 to 14. 

20. A plant comprising cells according to claim 19. 

21. A seed according to claim 19 wherein said seed is from a plant selected from the group consisting of 
75 Brassica, cotton, soybean, safflower and sunflower. 

22. A method for modifying the genotype of a plant to impart a desired characteristic to seed as distinct 
from other plant tissue said method comprising: 

transforming a host plant cell under genomic integration conditions with a DNA construct according 
20 to any one of claims 1 to 14 

growing said plant to produce seed. 

23. A method for specifically modifying the phenotype of seed distinct from other plant tissue, said method 
comprising: 

25 (i) transforming a host plant cell under genomic integration conditions with a DNA construct 

according to any one of claims 1 to 14; 

(ii) growing a plant under conditions to produce seed, said plant being comprised of cells capable of 
developing seed tissue and said cells having integrated in their genome, a said DNA construct. 

A method for modifying the genotype of a plant to impart a desired characteristic to seed as distinct 
from other plant tissue said method comprising; 

transforming a host plant cell under genomic integration conditions with a DNA construct compris- 
ing in the 5'-3' direction of transcription: 

a seed specific transcriptional initiation region; 

a DNA sequence of interest other than the natural coding sequence joined to said initiation region; 

and 

a transcriptional termination region. 

A method for specifically modifying the phenotype of seed distinct from other plant tissue, said method 
comprising: 

(i) transforming a host plant cell under genomic integration conditions with a DNA construct 
comprising in the 5'-Z* direction of transcription: 

a seed specific transcriptional initiation region; 

a DNA sequence of interest other than the natural coding sequence joined to said initiation 
region; and 

a transcriptional termination region; 

(ii) growing a plant under conditions to produce seed, said plant being comprised of cells capable of 
developing seed tissue and said cells having integrated in their genome a said DNA construct. 

50 Claims for the following Contracting States : AT, ES 

1. A method comprising the production of a DNA construct comprising in the 5'-3' direction of transcrip- 
tion: 

a seed specific transcriptional initiation region which is from other than the bean phaseolin 
55 promoter; 

a DNA sequence of interest other than the natural coding sequence joined to said initiation region; 

and 

a transcriptional termination region. 

16 



30 24. 



35 



25. 

40 
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2. A method according to claim 1 , wherein said construct further comprises a translational initiation region 
immediately downstream of said transcription initiation region. 

3. A method according to any one of claims 1 or 2 wherein said DNA sequence of interest is an open 
5 reading frame encoding an amino acid sequence. 

4. A method according to any one of claims 1 or 2 wherein said DNA sequence of interest is 
complementary to a mRNA endogenous to a plant seed cell. 

to 5. A method according to any one of claims 1 to 4 wherein said natural coding sequence is expressed in 
a seed embryo cell. 

6. A method according to any one of claims 1 to 5 wherein said natural coding sequence is expressed at 
about 7 to about 40 days postanthesis. 

75 

7. A method according to claim 6 wherein said natural coding sequence is expressed at about 10 to about 
30 days postanthesis. 

8. A method according to any one of claims 1 to 7 wherein said initiation region is a storage protein gene 
20 initiation region or a fatty acid biosynthesis protein gene initiation region. 

9. A method according to claim 8 wherein said storage protein is napin. 

10. A method according to claim 8 wherein said fatty acid biosynthesis protein is acyl carrier protein. 

25 

11. A method according to any one of claims 1 to 10 wherein said transcriptional initiation region is from a 
Brassica seed gene. 

12. A method according to any one of claims 1 to 11 wherein said transcriptional initiation region is 
30 obtainable from Fig. 1 or Fig. 2, and comprises a region of about 300 nucleotides 5* of the ATG 

translation start codon of a napin gene. 

13. A method according to any one of claims 1 to 12 wherein said DNA sequence of interest is associated 
with fatty acid synthesis. 

35 

14. A method according to any one of claims 1 to 13 wherein said transcriptional termination region is from 
a gene native to said transcriptional initiation region. 

15. A method comprising the production of a DNA construct comprising in the 5'-3' direction of transcrip- 
40 tion, a seed-specific transcriptional initiation region wherein said transcriptional initiation region is free 

from the native DNA sequence under the regulatory control of said initiation region, a cloning site, and 
a transcriptional termination region. 

16. A method according to claim 15 also comprising a translational initiation region positioned after said 
45 transcriptional initiation region in the 5' to 3* direction. 

17. A method according to claim 16 wherein a DNA sequence of interest other than the native sequence is 
inserted into said cloning site. 

so 18. A method comprising the production of a vector comprising a DNA construct obtainable according to 
any one of claims 1 to 17, a prokaryotic replication system and a marker for selection of transformed 
prokaryotes comprising said marker. 

19. A method comprising the production of a plant cell or seed involving transformation with a DNA 
55 construct obtainable according to any one of claims 1 to 14. 

20. A method according to claim 19 wherein said seed is from a plant selected from the group consisting 
of Brassica, cotton, soybean, safflower and sunflower. 



17 



EP 0 255 378 B1 



21. A method for modifying the genotype of a plant to impart a desired characteristic to seed as distinct 
from other plant tissue said method comprising: 

transforming a host plant cell under genomic integration conditions with a DNA construct according 
to any one of claims 1 to 14. 
5 growing said plant to produce seed. 

22. A method for specifically modifying the phenotype of seed distinct from other plant tissue, said method 
comprising: 

(i) transforming a host plant cell under genomic integration conditions with a DNA construct 
to according to any one of claims 1 to 14; 

(ii) growing a plant under conditions to produce seed, said plant being comprised of cells capable of 
developing seed tissue and said cells having integrated in their genome, a said DNA construct. 

23. A method for modifying the genotype of a plant to impart a desired characteristic to seed as distinct 
75 from other plant tissue said method comprising; 

transforming a host plant cell under genomic integration conditions with a DNA construct compris- 
ing in the 5'-3' direction of transcription: 

a seed specific transcriptional initiation region; 

a DNA sequence of interest other than the natural coding sequence joined to said initiation region; 

20 and 

a transcriptional termination region. 

24. A method for specifically modifying the phenotype of seed distinct from other plant tissue, said method 
comprising: 

25 (i) transforming a host plant cell under genomic integration conditions with a DNA construct 

comprising in the S'-Z' direction of transcription: 
a seed specific transcriptional initiation region; 

a DNA sequence of interest other than the natural coding sequence joined to said initiation 
region; and 

30 a transcriptional termination region; 

(ii) growing a plant under conditions to produce seed, said plant being comprised of cells capable of 
developing seed tissue and said cells having integrated in their genome a said DNA construct. 

PatentansprUche 

35 PatentansprUche fUr folgende Vertragsstaaten : BE, CH, DE, FR, GB, GR, IT, LI, LU, NL, SE 

1. DNA-Konstrukt, das in der S'-S'-Transkriptionsrichtung enthalt: 

eine samenspezifische Transkriptionsinitiationsregion, die nicht vom Bohnen-Phaseolinpromoter 
stammt; 

40 eine interessierende DNA-Sequenz, die nicht die mit der genannten Initiationsregion verbundene 
natUrliche Kodierungssequenz ist; und 
eine Transkriptionsterminationsregion. 

2. DNA-Konstrukt nach Anspruch 1, worin das genannte Konstrukt welters eine Translationsinitiationsre- 
45 gion unmittelbar stromabwarts der genannten Transkriptionsinitationsregion enthalt. 

3. DNA-Konstrukt nach einem der AnsprUche 1 Oder 2, worin die genannte interessierende DNA-Sequenz 
ein offener Leseraster ist, der fur eine Aminosauresequenz kodiert. 

50 4. DNA-Konstrukt nach einem der AnsprUche 1 Oder 2, worin die genannte interessierende DNA-Sequenz 
komplementar zu einer mRNA ist, die fUr eine Pflanzensamenzelle endogen ist. 

5. DNA-Konstrukt nach einem der AnsprUche 1 bis 4, worin die genannte natUrliche Kodierungssequenz in 
einer Samenembryozelle exprimiert ist. 

55 

6. DNA-Konstrukt nach einem der AnsprUche 1 bis 5, worin die genannte natUrliche Kodierungssequenz 
bei etwa 7 bis etwa 40 Tagen postanthesis exprimiert wird. 
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7. DNA-Konstrukt nach Anspruch 6, worin die genannte natOrliche Kodierungssequenz bei etwa 10 bis 20 
Tagen postanthesis exprimiert wird. 

8. DNA-Konstrukt nach einemder AnsprOche 1 bis 7,worin die genannte Initiationsregion eine Speicherpro- 
5 teingen-lnitiationsregion oder eine Fettsaurebiosyntheseproteingen-lnitiationsregion ist. 

9. DNA-Konstrukt nach Anspruch 8, worin das genannte Speicherprotein Napin ist. 

10. DNA-Konstrukt nach Anspruch 8, worin das genannte Fettsaurebiosyntheseprotein Acyltragerprotein ist. 

70 

11- DNA-Konstrukt nach einem der AnsprOche 1 bis 10, worin die genannte Transkriptionsinitiationsregion 
von einem Brassica-Samengen stammt 

12. DNA-Konstrukt nach einem der AnsprOche 1 bis 4, worin die genannte Transkriptionsinitiationsregion 
75 von Fig. 1 oder Fig. 2 erhSltlich ist und eine Region mit etwa 300 Nukleotiden 5' vom ATG- 

Translationsstartkodon eines Napingens umfaBt. 

13. DNA-Konstrukt nach einem der AnsprOche 1 bis 12, worin die genannte interessierende DNA-Sequenz 
mit Fettsauresynthese assoziiert ist. 

20 

14. DNA-Konstrukt nach einem der AnsprOche 1 bis 13, worin die genannte Transkriptionsterminationsre- 
gion von einem Gen stammt, das fOr die genannte Transkriptionsinitiationsregion nativ ist. 

15. DNA-Konstrukt, das in der S'-S'-Transkriptionsrichtung eine samenspezifische Transkriptionsinitiations- 
25 region, worin die genannte Transkriptionsinitiationsregion unter der regulierenden Steuerung durch die 

genannte Initiationsregion frei von der nativen DNA-Sequenz ist, eine Klonierungstelle und eine 
Transkriptionsterminationsregion enthalt. 

16. DNA-Konstrukt nach Anspruch 15, das auch eine Translationsinitiationsregion enthalt, die in der 5'-3'- 
30 Richtung nach der genannten Transkriptionsinitiationsregion angeordnet ist. 

17. DNA-Konstrukt nach Anspruch 16, worin eine interessierende DNA-Sequenz, die nicht die native 
Sequenz ist, in die genannte Klonierungsstelle eingefOgt ist. 

35 18. Vektor, der ein DNA-Konstrukt nach einem der AnsprOche 1 bis 17, ein prokaryotisches Replikationssy- 
stem und einen Marker zur Auswahl von transformierten Prokaryoten umfaGt, welche den genannten 
Marker enhalten. 

19. Pflanzenzelle oder -samen, die/der ein DNA-Konstrukt nach einem der AnsprOche 1 bis 14 enthalt. 

40 

20. Pflanze, die Zellen nach Anspruch 19 enthalt. 

21. Samen nach Anspruch 19, worin der genannte Samen von einer Pflanze stammt, die aus der Gruppe 
ausgewahlt ist, die aus Brassica, Baumwolle, Sojabohne, Saflor und Sonnenblume ausgewahlt ist. 

45 

22. Verfahren zum Modifizieren des Genotyps einer Pflanze, um Samen eine gewOnschte Eigenschaft zu 
verleihen, die ihn von anderem Pflanzengewebe unterscheidet, wobei das genannte Verfahren umfaBt: 
das Transformieren einer Wirtspflanzenzelle unter genomischen Integrationsbedingungen mit einem 
DNA-Konstrukt nach einem der AnsprOche 1 bis 14, 

50 das ZOchten der genannten Pflanze, um Samen zu erzeugen. 

23. Verfahen zum spezifischen Modifizieren des Phenotyps von Samen, der sich von anderem Pflanzenge- 
webe unterscheidet, wobei das genannte Verfahren umfaBt: 

(i) das Transformieren einer Wirtspflanzenzelle unter genomischen Integrationsbedingungen mit 
55 einem DNA-Konstrukt nach einem der AnsprOche 1 bis 14; 

(ii) das ZOchten einer Pflanze unter Bedingungen zu Erzeugung von Samen, wobei die genannte 
Pflanze aus Zellen besteht, die fahig sind, Samengewebe zu entwickeln, und die genannten Zellen in 
ihrem Genom ein genanntes DNA-Konstrukt integriert haben. 
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24. Verfahren zum Modifizieren des Genotyps einer Pflanzen, urn Samen eine gewOnschte Eigenschaft zu 
verleihen, die ihn von anderem Pflanzengewebe unterscheidet, wobei das genannte Verfahren umfaflt: 
das Transformieren einer Wirtspflanzenzelle unter genomischen Integrationsbedingungen mit einem 
DNA-Konstrukt, das in der S'-S'-Transkriptionsrichtung umfaBt: 
5 eine samenspezifische Transkriptionsinitiationsregion; 

eine interessierende DNA-Sequenz, die nicht die mit der genannten Initiationsregion verbundene 
natUrliche Kodierungssequenz ist; und 
eine Transkriptionsterminationsregion. 



70 25. Verfahren zum spezifischen Modifizieren des Phenotyps von Samen, der sich von anderem Pflanzenge- 
webe unterscheidet, wobei das genannte Verfahren umfaBt: 

(i) das Transformieren einer Wirtspflanzenzelle unter genomischen Integrationsbedingungen mit 
einem DNA-Konstrukt, das in der S'-S'-Transkriptionsrichtung umfaBt: 

eine samenspezifische Transkriptionsinitiationsregion; 
75 eine interessierende DNA-Sequenz, die nicht die mit der genannten Initiationsregion verbundene 

natUrliche Kodierungssequenz ist; 
eine Transkriptionsterminationsregion; 

(ii) das ZUchten einer Pflanze unter Bedingungen zur Erzeugung von Samen, wobei die genannte 
Pflanze aus Zellen besteht, die fahig sind, Samengewebe zu entwickeln, und die genannten Zellen in 

20 ihrem Genom ein genanntes DNA-Konstrukt integriert aufweisen. 

Patentanspriiche flir folgende Vertragsstaaten : AT, ES 

1. Verfahren, das die Herstellung eines DNA-Konstrukts umfaBt, das in der S'-S'-Transkriptionsrichtung 
25 enthalt: 

eine samenspezifische Transkriptionsinitiationsregion, die nicht vom Bohnen-Phaseolinpromoter 
stammt; 

eine interessierende DNA-Sequenz, die nicht die mit der genannten Initiationsregion verbundene 
natUrliche Kodierungssequenz ist; und 
30 eine Transkriptionsterminationsregion. 

2. Verfahren nach Anspruch 1, worin das genannte Konstrukt weiters eine Translationsinitiationsregion 
unmittelbar stromabwarts der genannten Transkriptionsinitationsregion umfaBt. 

35 3. Verfahren nach einem der AnsprUche 1 Oder 2, worin die genannte interessierende DNA-Sequenz ein 
offener Leseraster ist, der fOr eine Aminosauresequenz kodiert. 

4. Verfahren nach einem der AnsprUche 1 Oder 2, worin die genannte interessierende DNA-Sequenz 
komplementSr zu einer mRNA ist, die fUr eine Pflanzensamenzelle endogen ist. 

40 

5. Verfahren nach einem der AnsprUche 1 bis 4, worin die genannte natUrliche Kodierungssequenz in 
einer Samenembryozelle exprimiert wird. 

6. Verfahren nach einem der AnsprUche 1 bis 5, worin die genannte natUrliche Kodierungssequenz bei 
45 etwa 7 bis etwa 40 Tagen postanthesis exprimiert wird. 

7. Verfahren nach Anspruch 6, worin die genannte natUrliche Kodierungssequenz bei etwa 10 bis 20 
Tagen postanthesis exprimiert wird. 

so 8. Verfahren nach einem der AnsprUche 1 bis 7, worin die genannte Initiationsregion eine Speicherprotein- 
gen-lnitiationsregion Oder eine Fettsaurebiosyntheseproteingen-lnitiationsregion ist 

9. Verfahren nach Anspruch 8, worin das genannte Speicherprotein Napin ist. 

55 10. Verfahren nach Anspruch 8, worin das genannte Fettsaurebiosyntheseprotein Acyltragerprotein ist. 

11. Verfahren nach einem der AnsprUche 1 bis 10. worin die genannte Transkriptionsinitiationsregion von 
einem Brassica-Samengen stammt. 
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12. Verfahren nach einem der AnsprUche 1 bis 11, worin die genannte Transkriptionsinitiationsregion von 
Fig. 1 Oder Fig. 2 erhSltlich ist und eine Region mit etwa 300 Nukleotiden 5' vom ATG-Translations- 
startkodon eines Napingens umfaBt. 

5 13. Verfahren nach einem der AnsprUche 1 bis 12, worin die genannte interessierende DNA-Sequenz mit 
Fettsauresynthese assoziiert ist. 

14. Verfahren nach einem der AnsprUche 1 bis 13. worin die genannte Transkriptionsterminationsregion von 
einem Gen stammt, das fUr die genannte Transkriptionsinitiationsregion nativ ist. 

70 

15. Verfahren, das die Hersteilung eines DNA-Konstrukts umfaBt, das in der S'-S'-Transkriptionsrichtung 
eine samenspezifische Transkriptionsinitiationsregion, worin die genannte Transkriptionsinitiationsregion 
unter der regulierenden Steuerung durch die genannte Initiationsregion frei von der nativen DNA- 
Sequenz ist eine Klonierungstelle und eine Transkriptionsterminationsregion e nth Sit. 

75 

16. Verfahren nach Anspruch 15, das auch eine Translationsinitiationsregion enthalt, die in der 5'-3'- 
Richtung nach der genannten Transkriptionsinitiationsregion angeordnet ist. 

17. Verfahren nach Anspruch 16, worin eine interessierende DNA-Sequenz, die nicht die native Sequenz 
20 ist, in die genannte Klonierungsstelle eingefugt wird. 

18. Verfahren, das die Hersteilung eines Vektors umfaBt, der ein DNA-Konstrukt, das nach einem der 
AnsprUche 1 bis 17 erhaltlich ist, ein prokaryotisches Replikationssystem und einen Marker zur 
Auswahl von transformierten Prokaryoten umfaBt, welche den genannten Marker enthalten. 

25 

19. Verfahren, das die Hersteilung einer/eines Pflanzenzelle oder -samen umfaBt, an der die Transformation 
mit einem DNA-Konstrukt beteiligt ist, das nach einem der AnsprUche 1 bis 14 erhaltlich ist. 

20. Verfahren nach Anspruch 19, worin der genannte Samen von einer Pflanze stammt, die aus der Gruppe 
30 ausgewahlt ist, die aus Brassica, Baumwolle, Sojabohne, Saflor und Sonnenblume ausgewahlt ist. 

21. Verfahren zum Modifizieren des Genotyps einer Pflanze, urn Samen eine gewUnschte Eigenschaft zu 
verleihen, die ihn von anderem Pflanzengewebe unterscheidet, wobei das genannte Verfahren umfaBt: 
das Transform ieren einer Wirtspflanzenzelle unter genomischen Integrationsbedingungen mit einem 

35 DNA-Konstrukt nach einem der AnsprUche 1 bis 14, 

das ZUchten der genannten Pflanze, um Samen zu erzeugen. 

22. Verfahen zum spezifischen Modifizieren des Phenotyps von Samen, der sich von anderem Pflanzenge- 
webe unterscheidet, wobei das genannte Verfahren umfaBt: 

40 (i) das Transformieren einer Wirtspflanzenzelle unter genomischen Integrationsbedingungen mit 

einem DNA-Konstrukt nach einem der AnsprUche 1 bis 14; 

(ii) das ZUchten einer Pflanze unter Bedingungen zu Erzeugung von Samen, wobei die genannte 
Pflanze aus Zellen besteht, die fahig sind, Samengewebe zu entwickeln, und die genannten Zellen in 
ihrem Genom ein genanntes DNA-Konstrukt integriert haben. 

45 

23. Verfahren zum Modifizieren des Genotyps einer Pflanzen, um Samen eine gewUnschte Eigenschaft zu 
verleihen, die ihn von anderem Pflanzengewebe unterscheidet. wobei das genannte Verfahren umfaBt: 
das Transformieren einer Wirtspflanzenzelle unter genomischen Integrationsbedingungen mit einem 
DNA-Konstrukt, das in der 5'-3 , -Transkriptionsrichtung umfaBt: 

so eine samenspezifische Transkriptionsinitiationsregion; 

eine interessierende DNA-Sequenz, die nicht die mit der genannten Initiationsregion verbundene 
natUrliche Kodierungssequenz ist; und 
eine Transkriptionsterminationsregion. 

55 24. Verfahren zum spezifischen Modifizieren des Phenotyps von Samen, der sich von anderem Pflanzenge- 
webe unterscheidet, wobei das genannte Verfahren umfaBt: 

(i) das Transformieen einer Wirtspflanzenzelle unter genomischen Integrationsbedingungen mit 
einem DNA-Konstrukt, das in der 5'-3'-Transkriptionsrichtung umfaBt: 
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eine samenspezifische Transkriptionsinitiationsregion; 

eine interessierende DNA-Sequenz, die nicht die mit der genannten Initiationsregion verbundene 
natUrliche Kodierungssequenz ist; und 
eine Transkriptionsterminationsregion; 
5 (ii) das ZUchten einer Pflanze unter Bedingungen zur Erzeugung von Samen, wobei die genannte 

Pflanze aus Zelten besteht, die fahig sind, Samengewebe zu entwickeln, und die genannten Zellen in 
ihrem Genom ein genanntes DNA-Konstrukt integriert aufweisen. 

Revendlcatlons 

70 Revendlcatlons pour les Etats contractants suivants : BE, CH, DE, FR, GB, GR, IT, LI, LU, NL, SE 

1. Construction d'ADN comprenant, dans la direction 5'-3' de transcription : 

une region d'initiation de transcription spdcifique d'une graine qui provient d'autre chose que le 
promoteur de phas£oline du haricot ; 
75 une sequence d' ADN d'int^ret autre que la sequence codante naturelle jointe h ladite region 

d'initiation ; et 

une region de terminaison de transcription. 

2. Construction d'ADN selon la revendication 1, ou ladite construction comprend de plus une region 
20 d'initiation de traduction immSdiatement en aval de ladite region d'initiation de transcription. 

3. Construction d'ADN selon I'une quelconque des revendications 1 ou 2, ou ladite sequence d'ADN 
d'interet est un cadre de lecture ouvert codant une sequence d'acides amines. 

25 4. Construction d'ADN selon I'une quelconque des revendications 1 ou 2, ou ladite sequence d'ADN 
d'int^ret est compl6mentaire d'un ARNm endog&ne h une cellule de graine de plante. 

5. Construction d'ADN selon I'une quelconque des revendications 1 k 4, oti ladite sequence codante 
naturelle s'exprime dans une cellule d'embryon de graine. 

30 

6. Construction d'ADN selon Tune quelconque des revendications 1 & 5, ou ladite sequence codante 
naturelle s'exprime aux environs de 7 h environ 40 jours apr£s I'anthfese. 

7. Construction d'ADN selon la revendication 6, ou ladite sequence codante naturelle s'exprime aux 
35 environs de 10 & environ 20 jours apr£s I'anth&se. 

8. Construction d'ADN selon I'une quelconque des revendications 1 h 7, oCi ladite region d'initiation est 
une region d'initiation d'un g&ne d'une proteine de stockage ou une region d'initiation d'un gene d'une 
proline de biosynthfese d'un acide gras. 

40 

9. Construction d'ADN selon la revendication 8, ou ladite proline de stockage est la napine. 

10. Construction d'ADN selon la revendication 8, ou ladite proline de biosynthfese d'un acide gras est une 
proline porteuse d'acyle. 

45 

11. Construction d'ADN selon I'une quelconque des revendications 1 £ 10, ou ladite region d'initiation de 
transcription provient du gfene de la graine de Brassica. 

12. Construction d'ADN selon I'une quelconque des revendications 1 k 4, oO ladite region d'initiation de 
so transcription est obtenue de la figure 1 ou de la figure 2 et comprend une region d'environ 300 

nucleotides h 5' du codon de d6but de traduction de ATG d'un g&ne de napine. 

13. Construction d'ADN selon I'une quelconque des revendications 1 & 12, ou ladite sequence d'ADN 
d'int^ret est assoctee h la synthase d'un acide gras. 

55 

14. Construction d'ADN selon I'une quelconque des revendications 1 ^ 13, ou ladite region de terminaison 
de transcription provient d'un g&ne natif de ladite region d'initiation de transcription. 
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15. Construction d'ADN comprenant, dans la direction 5'-3' de la transcription, une region d'initiation de 
transcription specifique de la graine ou ladite region d'initiation de transcription est libre de la sequence 
d'ADN native sous le controle r^gulateur de ladite region d'initiation, un site de clonage et une region 
de terminaison de transcription. 

5 

16. Construction d'ADN selon la revendication 15, comprenant egalement une region d'initiation de 
traduction plac£e apr£s ladite region d'initiation de transcription dans la direction 5'-3 f . 

17. Construction d'ADN selon la revendication 16, ou une sequence d'ADN d'int4r§t, autre que la sequence 
ro native, est ins£r£e dans ledit site de clonage. 

18. Vecteur comprenant une construction d'ADN selon Tune quelconque des revendications 1 & 17, un 
systfeme de replication procaryote et un marqueur pour la selection de procaryotes transformds 
comprenant ledit marqueur. 

15 

19. Cellule de plante ou graine comprenant une construction d'ADN selon Tune quelconque des revendica- 
tions 1 & 14. 

20. Plante comprenant des cellules selon la revendication 19. 

20 

21. Graine selon la revendication 19, ou ladite graine provient d'une plante choisie dans le groupe 
consistant en Brassica, coton, soja, carthame et tournesol. 

22. Methode de modification du genotype d'une plante pour impartir une caract£ristique souhait£e & une 
25 graine pour la distinguer d'un autre tissu de plante, ladite methode comprenant : 

la transformation d'une cellule d'une plante hote en conditions d'integration genomique avec une 
construction d'ADN selon Tune quelconque des revendications 1 & 14, 
la croissance de ladite plante pour produire la graine. 

30 23. Methode pour la modification specifique du phenotype d'une graine distincte d'un autre tissu de plante, 
ladite methode comprenant : 

(i) la transformation d'une cellule d'une plante hote en conditions d'integration genomique avec une 
construction d'ADN selon Tune quelconque des revendications 1 h 14 ; 

(ii) la croissance d'une plante dans des conditions de production de la graine, ladite plante se 
35 composant de cellules capables de developper le tissu de la graine et lesdites cellules ayant int§gr€ 

dans leur genome, ladite construction d'ADN. 

24. Methods de modification du genotype d'une plante pour impartir une caract£ristique souhaitee & la 
graine, la distinguant d'un autre tissu de la plante, ladite methode comprenant : 

40 la transformation d'une cellule d'une plante hote en conditions d'integration genomique avec une 

construction d'ADN comprenant, dans la direction 5'-3' de transcription : 
une region d'initiation de transcription specifique de la graine ; 

une sequence d'ADN d'int£ret autre que la sequence codante naturelle jointe h ladite region 
d'initiation ; et 
45 une region de terminaison de transcription. 

25. Methode pour modifier specifiquement le phenotype d'une graine la distinguant d'un autre tissu de 
plante, ladite methode comprenant : 

(i) la transformation d'une cellule d'une plante hote en conditions d'integration genomique avec une 
so construction d'ADN comprenant, dans la direction 5'-3 f de la transcription : 

une region d'initiation de transcription specifique de la graine ; 

une sequence d'ADN d'interet autre que la sequence codante naturelle jointe a ladite region 
d'initiation ; et 

une region de terminaison de transcription ; 
55 (ii) la croissance d'une plante en conditions pour produire la graine, ladite plante se composant de 

cellules capables de developper le tissu de la graine et lesdites cellules ayant integre dans leur 
genome une construction d'ADN. 
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Revendlcations pour les Etats contractants sulvants : AT, ES 

1. Mgthode comprenant la production d'une construction d'ADN comprenant, dans la direction 5'-3' de 
transcription : 

5 une region d'initiation de transcription sp£cifique d'une graine qui provient d'autre chose qu I 
promoteur de phas£oline du haricot ; 

une sequence d'ADN d'intSret autre que la sequence codante naturelte jointe k ladite region d'initiation 
et 

70 une region de terminaison de transcription. 

2. Methode selon la revendication 1, oCi ladite construction comprend de plus une region d'initiation de 
traduction imm^diatement en aval de ladite region d'initiation de transcription. 

75 3. M&hode selon Tune quelconque des revendications 1 ou 2, ou ladite sequence d'ADN d'int€r§t est un 
cadre de lecture ouvert codant une sequence d'acides amines. 

4. M&hode selon I'une quelconque des revendications 1 ou 2, ou ladite sequence d'ADN d'interet est 
complgmentaire d'un ARNm endogfene k une cellule de graine de plante. 

20 

5. Methode selon I'une quelconque des revendications 1 k 4, ou ladite sequence codante naturelle 
s'exprime dans une cellule d'embryon de graine. 

6. Mgthode selon Tune quelconque des revendications 1 k 5, ou ladite sequence codante naturelle 
25 s'exprime aux environs de 7 it environ 40 jours apr&s l'anth£se. 

7. Methode selon la revendication 6, oO ladite sequence codante naturelle s'exprime aux environs de 10 k 
environ 20 jours aprfes I'anth&se. 

30 8. Methode selon I'une quelconque des revendications 1 k 7, ou ladite region d'initiation est une region 
d'initiation d'un gfene d'une proline de stockage ou une region d'initiation d'un gfene d'une proline de 
biosynthfese d'un acide gras. 

9. Methode selon la revendication 8, ou ladite proline de stockage est la napine. 

35 

10. Methode selon la revendication 8. ou ladite proline de biosynth&se d'un acide gras est une proteine 
porteuse d'acyle. 

11. Methode selon I'une quelconque des revendications 1 k 10, oCi ladite region d'initiation de transcription 
40 provient du gSne de la graine de Brassica. 

12. Methode selon I'une quelconque des revendications 1 & 11, ou ladite region d'initiation de transcription 
peut etre obtenue de la figure 1 ou de la figure 2 et comprend une region d'environ 300 nucleotides k 
5* du codon de debut de traduction de ATG d'un gfcne de la napine. 

45 

13. Methode selon Tune quelconque des revendications 1 k 12, ou ladite sequence d'ADN d'interet est 
associee k la synthase d'un acide gras. 

14. Methode selon I'une quelconque des revendications 1 & 13, ou ladite region de terminaison de 
50 transcription provient d'un gfene natif de ladite region d'initiation de transcription. 

15. Methode comprenant la production d'une construction d'ADN comprenant, dans la direction 5'-3' de 
transcription, une region d'initiation de transcription sp£cifique de la graine ou ladite region d'initiation 
de transcription est exempte de la sequence d'ADN native sous le contrdle r^gulateur de ladite region 

55 d'initiation, un site de clonage et une region de terminaison de transcription. 

16. Methode selon la revendication 15, comprenant ggalement une region d'initiation de traduction placde 
aprfes ladite region d'initiation de transcription dans la direction 5' k 3'. 
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17. Methods selon la revendication 16, oCi une sequence d'ADN d'int6ret, autre que la sequence native, est 
ins£r£e dans (edit site de clonage. 

18. Methode comprenant la production d'un vecteur comprenant une construction d'ADN que I'on peut 
obtenir s Ion Tune quelconque des revendications 1 & 17, un systfeme de replication procaryote et un 
marqueur pour une selection de procaryotes transform^ comprenant (edit marqueur. 

19. Mgthode comprenant la production d'une cellule de plante impliquant la transformation avec une 
construction d'ADN que Ton peut obtenir selon Tune quelconque des revendications 1 & 14. 

20. M6thode selon la revendication 19, ou ladite graine provient d'une plante choisie dans le groupe 
consistant en Brassica, coton, soja, carthame et tournesol. 

21. Methode de modification du genotype d'une plante pour im parti r une caract£ristique souhait£e & une 
graine, la distinguant d'un autre tissu de plante, ladite methode comprenant : 

la transformation d'une cellule d'une plante hote en conditions d'intSgration gSnomique avec une 
construction d'ADN selon Tune quelconque des revendications 1 & 14, la croissance de ladite plante 
pour produire la graine. 

22. M&hode pour la modification sp^cifique du ph^notype d'une graine distincte d'un autre tissu de plante, 
ladite m&hode comprenant : 

(i) la transformation d'une cellule d'une plante hote en conditions d'int£gration ggnomique avec une 
construction d'ADN selon Tune quelconque des revendications 1 & 14 ; 

(ii) la croissance d'une plante dans des conditions de production de la graine, ladite plante se 
composant de cellules capables de d£velopper le tissu de la graine et lesdites cellules ayant integre 
dans leur genome ladite construction d'ADN. 

23. M&hode de modification du genotype d'une plante pour impartir une caract6ristique souhait§e & la 
graine, la distinguant d'un autre tissu de la plante, ladite methode comprenant : 

la transformation d'une cellule d'une plante hote en conditions d'int£gration ggnomique avec une 
construction d'ADN comprenant, dans la direction 5'-3' de transcription : 
une region d'initiation de transcription spgcifique de la graine ; 

une sequence d'ADN d'int^ret autre que la sequence codante naturelle jointe h ladite region d'initiation 
;et 

une region de terminaison de transcription. 

24. Methode pour modifier sp^cifiquement le phenotype d'une graine, la distinguant d'un autre tissu de 
plante, ladite methode comprenant : 

(i) la transformation d'une cellule d'une plante hote en conditions d'int^gration g^nomique avec une 
construction d'ADN comprenant, dans la direction 5'-3' de la transcription : 

une region d'initiation de transcription sp^cifique de la graine ; 

une sequence d'ADN d'int^ret, autre que la sequence codante naturelle, jointe k ladite region 
d'initiation ; et 

une region de terminaison de transcription ; 

(ii) la croissance d'une plante en conditions pour produire la graine, ladite plante se composant de 
cellules capables de d£velopper le tissu de la graine et lesdites cellules ayant int£gr£ dans leur 
genome une construction d'ADN. 
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NAP -2 7 C Liattr LENGTH - 702 

SfaNX 

rolcl MA6III MbOlX 

tir i 

1 AAAGCGATCTGTCTTGGTAT5TATCTACaAGTAXCAAAAGAGAAC#ATCCAAITCAG7AGTAGAA3GAT*rTG 
19 31 53 

36 

Alul Ah*XII FokX EcoftV KgaX 

II III 
72 AGAGCTTTT 4 5 AAGCC3 4 TCA5GTGTGTGCTTT 4ATCTTATT6ATATCATCCAT? 4 GCGTTGTTTAATGC Z \ 
76 62 106 117 130 

Ddel 

143 TCTTTAGATATGTTTCTGTTTCTTTCTCAGTGTCTGAATATCTGATMCTGCAATGTGAGAAAGCCACACC 2 

169 

TaqT 

Sapl flinfl 

214 AAACCAAAAXATTCAAATCTTATATTTTTAATAATGTCGAATCACTCGGAGTTGCCACCTTCrGTGCCAAT 2 
224 253 

251 

BinfX MboII EcoRI 

285 TGTGCTGAATCTATCACACTAAAAAAAACATTTCTTCAAGGTAATGACTTGTGGACTATCGTTCTGAATTC 3 
292 310 351 

356 TCATTAAGTTTTTATTTTTTGAAGTTTAAGTXTrTACCTICTTTTXTGAAAAATAICGTTCATAAGATGTC 4 

424 

SphX 

SczFI Nlalll Nip (7524 ) X 

tcoRll AluX SfaNX Nlalll MMIXX 

M II I I I 

427 ACGCCAGGACATGAGCTACACATCACATATTAGCATGCAGATGCGGACSATTTGTCACTCACTtCAAACAC 4 
430 442 457 464 480 

432 440 464 

464 
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Sap: PtnaCI 
SphX Mbol 
Tthlilll ,fd t: N»p(7524)I MmXX 

*j ul AvaXIX Hiaiiz Afllll upnl Ni*i:i 

498 CTAAAA6AKTTCTCTCTCACAWACACACACMW S 
S 2' 532 339 548 S59 563 

908 531 539 550 

539 553 
543 551 

HphI Mnll 

I I 

5S9 TCaTTCTCXCCTAIAAATTACACCCTCSCCTTCACtTTTtACTCAAACCAXAACTCATCACTXCAAAACA 6 

5o9 583 

MboXI Alul Mail 

640 TACACAAJ,2£GCGAACAASCTCTTCCTCC^CTCGGCAACTCTCGCCTTCTTCTTCCTTCTCAC 702 
653 659 675 

SITE (NAP-270J: Q 
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Ltmbdt CQS1-2 

NCG-186 Linear LENGTH - 4325 

XhoT 

Taql flindlll 
Aval Alul Taql 

II III 
1 CTCGXGGCXCTC ACTAACATGAACTTTGXCGWCACCCCAACTXTCC5AACCTTATTTCTCTTTTCCAT 6 9 

2 52 66 

3 50 

2 

Sad 

fihax xbal Alul 

II II 
7 0 ACTCTAATTGAGCCGTGCGCTCTATCTAGACCAATTAGAATTGATGGAGCTCTAAAGGTTGCTGGCTGT 138 

89 95 119 

121 

NdaX Kdel 
I I 
139 TTTCTTGTTCATATGATT AACTTCTAAACTTGTGTATAAATATTCTCTGAAAGTGCTTCTTTTGGCATA 207 

ISO 206 
208 TGTAGGTTGGGCAAAAACGAGGAAGATTGCTTCTCAATTTGGAAGAGGATGAACAGCC3AAGAAGAAAA 276 



Sau3AX 
Ddal 

277 TAAGAATACGCAGTCCTGCTACTCAATGGATCTCAGTCTATAACGGTCGTCGTCCCATGAAACAGAGGT 345 

309 

305 

EcoRV 

346 AAAACATTTTTTGCATATACACTTTGAAAGTTCCTCACTAACTGTGTAATCTTTTGGTAGATATCACTA 414 

408 
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fiinex: 

fiMXII 
Dd«I 
BftEXX 

Ball HatXIX Alul 

415 CAA7G7CGGAGAGACAA3GGC7GMaTCANCATA7ACAA^ 4 83 

439 469 481 

438 

439 

439 
440 
438 

Alul BlnfX 
484 ?$7AGCATCAGCAGC7AA7C7CTGGGCTCTCA7CA?GC^TGCT6GAACTG<^TTCACTTCTCAAG?t?A 552 
498 535 

MipX 

HpaXI BinfX 
! I 
553 TGAGTTG7CACCGG7C77CC7ACACAAGG7AA7AA7CAG77GAAGCAA77AAGAA7CAATT7GAT77G7 621 

564 606 
564 

Ddel 

i 

622 AG7AAAC7AAGAAGAAC7TACCT7A7STTT7CCCCGCAGGAC7GGATTATGGAACAA7GGGAAAAGAAC 690 
629 

flacX 

Alul Alul Alul 

II II 
691 TACTA7A7AAGC7CCATAGC7GG7TCAGATAACGGGAGC7C7T7AG77GTTA7GTCAAAAGG7TAG7GT 759 

702 710 729 

731 

760 TTAG7GAA7AATAAAC77ATACCACAAAG7C7TCATTGACT7A777A7AXACTTGTTG7GAA77GC7AG 828 



DdtX Hlnfl 
829 GAAC7ACTTATTCTCAGCAG7CA7ACAAAGTGAG7GAC7CA7T7CCG7TCAAGTGGA7 AAA7AASAAA7 897 
842 865 
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XmnI TaqI 

I I 
898 GGAAAG AACATTXTCATGTAACCTCCATCACAACTOCTGCTMTCSXTCGGGTQTGCr AATGTCGACGA 966 

908 961 

8au3AI 
Bell 

967 ACTCTGGCTTCTCTGATCWGTAGGTTTTTGTCTCTTATIGtCTGGTGTTWTATrrTCCCCTGATAST 1035 

981 
981 

Alul IUaI 

1036 CTAATATGATAAACTCTGCGTTGTGAAAGGTGGXGGAGCTTGACTTTTTGTACCCAAGCGATGGGATAC 1104 

1074 * 1087 

$«u3AX Alul 
I I 

1105 ATAGGAGG7GGGAGAATG5GTATAGAATAACA7CAA7GGCAGCAACTGCGGATCAAGCAGC77TCATAT 1173 

1155 1165 

Seal 

flinfl R«aI 

I I 
1174 TAAGCATACCAAAGCGTAAGATGGTGGATGAAACTCAAGAGACTCTCCGCACCACCGCCTTTCCAAGTA 1242 

1215 12« 

1242 

Alul Sav3AX Ddel 

I I ■ 

1243 CTCATGTCAAGGTXGGTTTCTTTAGCTTTGAACACAGATTTGGATCTTTTTGTTTTGTTTCCATATACT 1311 

1268 1285 13U 

DdtX 

AvaXX AluX Hinfl *J*I 

III II 
1312 TAGGACCTGAGAGCTXTTGGT7GAITITTTTTTCAGGACAAATGGGCGAAGAATCTGTACAXTGCATCA 1380 

1315 1325 1363 1370 

1319 

1381 ATATGCTATGGCAGGACAGTGTGCTGATACACACTXAAGCATCATGTGGAAAGCCAAAGACAATTGGAG 1449 
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HinfX 
Ddei 

1450 CGAWCTCAGCGTCGTCATAATACCAATCAAACAC6TAAAACCACXC5CAXCCTCTTTCGTTGAATGTA 1518 

1456 
1454 

Rl&X 

1519 ATGAAAGGSATGTGTCTTGGTATGTATCTACGAATAACWAACASAAGATaGAATTAGTAGTAGAAATA 1567 

1548 

Alul EcoRV 
1588 TTTGGGAGCTTTTTAAGCCCTTCAA^TGTOCTTTTTATCTTATTGATATCATCCATTTGCGTTGTTXAA 1656 
1596 1635 

Xbtl Vdml 

1657 TGCGTCTCTAGATATGTTCCTATATCTTTCTCAGTGTCTGATAAGTGAAATGTGAGAAAACCATACCAA 1725 
1664 1687 

Hlafl 

1726 ACCAAAATATTCAAATCTTAtTTTTAATAATGTTGAATCACTCGGAGTTGCCACCTTCTGTGCCAATTG 1794 

1761 

HirrfX EcoRI 
j I 
1795 TGCTGAATCTATCACACTAGAAAAAAACATTTCTTCAAGGTAAT5ACTTGTGGACTATGTTCTGAA7TC 1863 

1800 1859 
1864 TCATTAAGTTTTTATTTTCTGAAGTTTAAGTTTTTACCTTCTGTTTTGAAATATATCGTTCATAAGATG 1932 



SphI 

BitNl Alul SAU3AI 
1933 TCACGCCAGGACATGAGCTACACATCGCACATAGCATGCAGATCAGGACGATTTGTCACTCACTTCAAA 20C1 

1940 1950 1973 

1971 
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SphI 

DdaX AlttI Hhal NdaX Nail Sau3AX 

2002 ^CCTAAGAGCTTCTCTCTCACAGCGCACACACATATGCATGCAATATTTAC^ 2070 

2006 2012 2C28 2036 2042 2058 

2044 

2071 ATCTCCATTCTCACCTATAAATTAGXGCCTC5CCTCCACTCTTTACTCAAACCAAAACTCATCACTACA 2139 



Alul 

2140 CAACATACACAAATGGCGAACAACCTCTTCCTCGTCTCGGCAACTCTCCCCTCGTTCTTCCTTCTCACC 2208 

MCTAlaAiatyiLtuPhtLtuVAlStrAltThrLfuAlaLtuPhtPhtLiuIouThr 
2164 

Nael 

Ball Mipl 
% „ Hindi Hpall 

AccX ^ ^accI KaaXIX 

2209 MTGCCTCCGTCrACAGGACGGTTG^ 2277 

AinAlaS#rValTyrArgThrVAlV*lGluValAipGluAtpA«pAla7hrAinProAlaGlyProPh« 
2220 2241 $271 

2239 2268 
2240 2268 
2241 2269 

„. Hindlll 
Binfl 

I I | 

2278 ^^J^£^^^J < 3 T WGAAGGAGTTTCAGCAAGCACAACACCTGAAAGCTTGCCAACAATGGCTCCAC 2346 
228i r ° Ly,CyaXrglyiC lu5h *^ 1 n51nAlaGlnHiatauLyfAlaCy$GlnGlnTrpLtuHif 

2325 

Mapl Avail 
fipall Avail AluX TaqX 

2347 ^SCAGGCAATGCAGTCCGGTAGTGGTCCAAGCTGGACCCTCGATGGTGAGTTTGATTTTGAAGACGM 2413 
lytGlnAiafffiTGlnSarGlySarGlvProS 

2364 2362 
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Ha«III SacI 

Apt! aatZII AIuX 

III II 
2416 CTGGAGAACCAACAACAGGGCCCGCAGCAflXSGCCACCGCTGCTCCAGCACfTGCTGCAACGAGCTCCAC 2484 
ValGluA$nGlnGlAGlnGIyProGlnGlnArgPro?roLtuL«uGlnGlnCy8CyiA0AGluL#uKis 

2438 2449 2479 

243$ 2481 

TaqI 

BatNX 

I I I 

2485 CAGGAAGAGCCACTTTGCGTTTGCCCAACCTTGAAAGGAGCATCCAAAGCCGTTAAACAACAGATTCGA 2553 
GlnGluGlu?roUuCysValCyi?roThrltuLy«GlyAlAS«rLytAlavalLysGlnGlnZl«Arg 

2486 255i 

2554 CAACAACAGGGACAACAAATGCAGGGACAGCAGATGCAGCAAG7GATTAGCCGTATCTACCAGACCGCT 2622 
GlAGlAGlnGlyGlnGlnMETGlnGlyGlnGlnMZ7GlnGlnVaXXl«S«rArgIl«TyrGlnThiAlA 

Alul BatNX 
I I 
2623 ACGCACTTACCTAGAGCTTGCAACATCAGGCAAGTTAGCATTTGCCCCTTCCAGAAGACCATGCCTGGG 2691 
TftrHiaLauProArgAlaCyaAsnXltArgGln^^ 

Hipl 

Hpall Xhol 
HaaXXX TaqI 

Apal Hint I Aval Accl 

2692 CCCGGCT7CTACTAGATTCCAAACGAAXATCC7CGAGAGTG7GTATACCACGGTGATATGAGTGTGG7T 2760 
FroGlyPheTyr • 

2694 2707 2724 2736 

2692 2725 

2694 2724 

2694 

Hindi RmX 
2761 GTTGATGTATG^TAACACTACATAGTCATGGTGTGTGTTCCATAAATAATGTACTAATGTAATAAGAAC 2629 
- 2771 2813 

Accl 

2830 TACTCCGTAGACGGTAATAAAAGAGAAGTTTTTTTTTTTACTCTTGCTACTTTCCTATAAAGTGATGAT 2898 
2838 
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Seal 
RaaX 

2699 7AACAACAGA7ACACCAAAAAGAAAACAATTAA7C7A7A7TCACAAT6AAGCACT^ 2967 

29S4 
2954 

Sau3AI 

2968 CA7G7CAGATTT7C77T77C7AAAXG7C7AAt7AAGCC77CAAGGC7AG7GATGA?AAAA^7CA7CCA 3C36 

3026 

Sau3AI Sau3AX 
BamHl Hinfl Bell 

I I I 

3C37 ATGG5ATCCAACAAAC3ACTCAAATCTGGTTTTGA*CAGATACTTCAAAACTATTTTTGTATTCATTAAA 3105 

3041 3053 3069 

3041 3069 

Hinfl 

3106 7XAXGCAAGXG7TC7777A777GG7GAAGACTCX77AGAAGCAAAGAACGACAAGCAGXAA7AAAAAAA 3174 

3135 

3175 ACAAAGT7CAG77T7AAGA7TTGTTA77GAC77A77G7CAT77CAAAAATA7AG7ATGA7AT7AA7A7A 3243 
3244 GTT77ATT7ATATAA7GC77G7C7A77CAAGAT77GAGAACATTAA7A7GATAC7G7CCACA7A7CCAA 3312 



Ndal . 

313 7A7AT7AAGT77CA777CTG77CAAACA7A7GA7AAGA7GG7CAAA7GA77ATGAG7T77GT7A7T7AC 3381 

3341 

TaqI SAU3AI 
AluZ Rial 

3382 C7GAAGAAAAGATAAG7GAGC77CGAG777C7GAA5GG7ACG7GAXCTTCA777CT7GGC7AAAAGCGA 3450 

3402 3421 
3405 3425 

3451 A7A7GACATCACCTAGAGAAAGCCGATAA7AG7AAACTC7G77C7TGGTT777GG7TTAA7CAAACCGA 3519 
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Mtpl 

MspZ Dd«I BpaXX 
Hptll AluX Udml HinfX 

I II I II 

3520 ACCGG7AGC7GAG7G7CAAG7CAGCAAACA7CGCAAACCA7A7G7CAA77CG77AGA77CCCGGT7TAA 3588 

3522 3528 3560 3576 

3522 3529 3581 

3581 

Mapx 

HpaXI 

I 

3589 G77G7AAACCGG7A777CA777GG7GAAAACCC7AGAAGCCA0CCANCC77T7XAA7C7AAX7777GCA 3657 

3598 
3598 

HinfX 
HincXX 
Dd«I BstKX 

I III 

3658 AACGAGAAG7CACCACACC7C7CCAC7AAAACCC7GAACC77AC7GAGAGAAGCAGAGKCA^?AAAGAA 3726 

3702 3718 
3715 
3714 

3727 CAAA7AAAACCCGAAGA7GAGACCACCACG7GCGGCGGGACG77CAGGGGACGGGGAGGAAGAGAA7GR 3793 



Avail 

Alul Avail 

II I 
3796 CGGCGG5MN777GG7GGCGGCGGCGGACG7777GG7GGCGGCGG7GGACGT777GG7GGCGGCGG7GGA 3864 

3804 3863 
3801 

IcoRV Av*II Ddel 

II I 
3865 CC777GG7GG7GGA7A7CG7GACGAAGGACC7CCCAG7GAAG7CA77GG77CG777AC7C7777C77AG 3933 

3880 3892 3930 
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Taqi Hindi XX 

flinli Alui Ddei 

II II I 

3934 TCGAATCTTATTCTTCCTCTGCTCCTTCTTTTACCGATAAXGCTTAACACTTTATTGATAAXCTTCTCA 4002 

3937 3976 4000 

3935 3974 

AluX XmnX Hinfl DdtX 

II II 
CCS GC777GAATGTGAA7GAAC7G777CCTGCT7A7TAG7G77CC7TTGTTTTGAGKGAA7CAC7G7C77A 4071 

4004 4023 4059 4069 

Hinfl 
I 

4C72 6CACTTTTCTTAGATTCATCTTTGTGTTTAAGTTAAAASGTAGAAACTTTCTGACTTGTCTCCGTTA7C 4140 
4085 

Hindi 
I 

4141 ACAAGG77AACTT7G77GG77A7AACAGAAGTTGCGACC777C7CCA7GCTTG7GAGGG7GATCC7G7G 4209 
4146 

Avail AluX DdtX 5au3AX 

III I 
4210 GACCAAGCTCTCTCAGGCGAAGATCCCTTAC7TCAA7GCCCCAATCTACT7GGAAAACAAGACACAGA7 4278 

4210 4217 4222 4231 

Sail 
PstX 

HindXIX BlncXI 
3au3AI AluX ACCI XCCRI 

4279 7GGGAAAG77GA7GAGATCCAAGC77GGGC7GCAGG7CGACGAA77C 4325 

4294 4302 4316 4321 

4300 4314 
4313 
4315 
4316 
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EA9-14 Linear LENGTH - 416 

ScrFI 

Maalll 
BpaXX 

CauXX Mnll HpaXI Nap 

III II 
1 CCAACCCSGTTACMTTC^AATACCGGGAGGXilXTATTGOTTTCCAATTCTATTCCTCCMTGTGTTCAGC 
7 21 59 71 

7 

9 

7 

Xxnalll 

Mm*I WDol 
Mbol HaeXIX 
DpnX Mill 
B»ml Bell MatUI Mrl Dpnx 

I t I I MM 

72 GGAGAATGCACAACTCATCTTGATCACGGGGTAACTCCGGTTGGATACGGCCGATCTAAAAACGGAT7AAA 14: 
62 93 102 120 126 

95 120 
93 122 
93 124 

120 

XhoII NlaXV 
Seal MltXV Nlaiil 

Mbol rinl NlallX 

Raal OpnX XcoRX Avail Mnll KboX 

Blnl BamHX Bin! MnlX Aaul MaaXX DpnX Bin! tco 

II I I I I I Ml II II I II 

143 GTACTGGATCCTCAAGAATTCATGGGGACCAAAATGGGGAGAACG7GGATACATGAGGATCAAAAAAGATA 213 
144 149 137 163 169 186 202 208 213 

143 131 159 169 190 200 

149 167 198 

143 131 187 
149 170 

Nlaxxx Rial 
BamI 

I I I 

214 TCAAGCCTAAACACGGACAATGTGGTCTTGCCATGAATGCTTCGTACCCAACTATGTg^AAAAATCGGTTC 2%\ 

233 ^ 
249 239 

HindlXI Nap (7324) X 

Hp a XX AluX Bla£X NUXXX 

I i I II, 
283 AATATCCGGTTAAGCTTTAGfiAI&&ftTGTGtGTGTXGGT7A7AA7TXAAGACTCTGT?GCA7GTAATTTGT 35 3 
291 299 335 348 

297 348 



FIGURE 3 
(Page 1 of 2) 



37 



EP 0 255 378 B1 



Sfatft 

336 GAAA7GSTAAG?TTXTG7GATGCAAAAGA7TTGA?m&A&AmM&AG0GSGSGaSCCG 416 
363 



Partial eequence o£ B. campestrii seed cONA EA9 sufficient 
to identify * genomic clone containing a proaotar. The poly- 
adanylation signal AATAAA ia underlined aa are the polyA tails. 
The itop codon of the presumed open reading frame ia double underlined 
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